AIDA GELINA BRIKEN nToF CRIB ISOLDE CIRCE nTOFCapture DESPEC DTAS EDI_PSA 179Ta CARME StellarModelling DCF K40
  AIDA, Page 30 of 46  ELOG logo
ID Date Author Subject
  338   Fri Aug 19 09:43:27 2016 TD, VP, reply PJCSFriday 19 August
> 
> 14.00 Both AIDA NIM bins U/S 
>       Upper NIM bin - BNC PB-4, Ortec 433, 2x CAEN N1419, 1x CAEN N1419B - ac mains fuse 100V/8A/slow? blowing
>       Lower NIM bin - 11x MACB - not powering up
> 
>       Replaced both NIM bins.
>       Upper NIM bin - 11x MACB
>       Lower NIM bin - BNC PB-4, Ortec 433, 2x CAEN N1419, 1x CAEN N1419B
> 
> 15.50 Julabo FL11006 chiller ON - set point +20 deg C
>       B2F/F11 ambient temp +24.0 deg C, RH 54%, DP +13.8 deg C
> 
>       Bias 6x MSL type BB18 DSSSDs - see attachments 1-2
> 
> 16.10 Startup DAQ nnaida1-24 *except* nnaida19
> 
>       System-wide checks
> 
>       Check clock status OK
>       Check ADC calibration - nnaida7 failed
>       Check ASIC clock timestamp - multiple failures - see attachment 3
>       Check SYNC errors not counted Since Baseline - see attachment 4
>       Check SYNC pulses received - see attachment 5
> 
>       BNC PB-4 
> 
>       Amplitude 90,000, x5 attenuator IN
>       t_r 50ns, tau_d 50us
>       Polarity -
>       INT ref, delay MIN, tail pulse
> 
>       FEE64 temperatures - see attachment 6
> 
> 16.34 R1 starts (directory Aug2016)
> 
>       slow comparator 0x10, LEC/MEC fast comparator 0x10
> 
>       merge 555k data items/s, tapeserver 4430 kb/s
Looks like a problem with the inter-MACB wiring.
The screen shots show groups of 4 with the problems.
I suggest re-seating the short HDMI cables and reset the system.
  337   Fri Aug 19 07:42:22 2016 TD, PVFriday 19 August
14.00 Both AIDA NIM bins U/S 
      Upper NIM bin - BNC PB-4, Ortec 433, 2x CAEN N1419, 1x CAEN N1419B - ac mains fuse 100V/8A/slow? blowing
      Lower NIM bin - 11x MACB - not powering up

      Replaced both NIM bins.
      Upper NIM bin - 11x MACB
      Lower NIM bin - BNC PB-4, Ortec 433, 2x CAEN N1419, 1x CAEN N1419B

15.50 Julabo FL11006 chiller ON - set point +20 deg C
      B2F/F11 ambient temp +24.0 deg C, RH 54%, DP +13.8 deg C

      Bias 6x MSL type BB18 DSSSDs - see attachments 1-2

16.10 Startup DAQ nnaida1-24 *except* nnaida19

      System-wide checks

      Check clock status OK
      Check ADC calibration - nnaida7 failed
      Check ASIC clock timestamp - multiple failures - see attachment 3
      Check SYNC errors not counted Since Baseline - see attachment 4
      Check SYNC pulses received - see attachment 5

      BNC PB-4 

      Amplitude 90,000, x5 attenuator IN
      t_r 50ns, tau_d 50us
      Polarity -
      INT ref, delay MIN, tail pulse

      FEE64 temperatures - see attachment 6

16.34 R1 starts (directory Aug2016)

      slow comparator 0x10, LEC/MEC fast comparator 0x10

      merge 555k data items/s, tapeserver 4430 kb/s

18.28 R4 starts

      Settings per R1

18.31 DAQ/merge stalls with (many) messages as follows

MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
Merge state now Paused
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in pending data, found 1146,
last 575
copyItems: queue 14, maybe copied in wrong buffer: sequence 1146, last 575
Merge_addItems: queue 14, maybe added wrong buffer: sequence 1146, last 575
MERGE Program (24316): check_for_data: index 14, unexpected header sequence number in new data, found 689, last 1258
copyItems: queue 14, maybe copied in wrong buffer: sequence 689, last 1258
Merge_addItems: queue 14, maybe added wrong buffer: sequence 689, last 1258
MERGE Data Link (24254): client closed connection
MERGE Data Link (24254): listening on port 11001
MERGE Data Link (24255): client closed connection
MERGE Data Link (24255): listening on port 11002
MERGE Data Link (24257): client closed connection
MERGE Data Link (24257): listening on port 11004
MERGE Data Link (24258): client closed connection
MERGE Data Link (24258): listening on port 11005
MERGE Data Link (24261): client closed connection
MERGE Data Link (24261): listening on port 11008
MERGE Data Link (24260): client closed connection
MERGE Data Link (24260): listening on port 11007
MERGE Data Link (24259): client closed connection
MERGE Data Link (24259): listening on port 11006
MERGE Data Link (24256): client closed connection
MERGE Data Link (24256): listening on port 11003
MERGE Data Link (24262): client closed connection
MERGE Data Link (24262): listening on port 11009
MERGE Data Link (24263): client closed connection
MERGE Data Link (24263): listening on port 11010
MERGE Data Link (24265): client closed connection
MERGE Data Link (24265): listening on port 11012
MERGE Data Link (24266): client closed connection
MERGE Data Link (24266): listening on port 11013
MERGE Data Link (24267): client closed connection
MERGE Data Link (24267): listening on port 11014
MERGE Data Link (24269): client closed connection
MERGE Data Link (24269): listening on port 11016
MERGE Data Link (24268): client closed connection
MERGE Data Link (24268): listening on port 11015
MERGE Data Link (24270): client closed connection
MERGE Data Link (24270): listening on port 11017
MERGE Data Link (24264): client closed connection
MERGE Data Link (24264): listening on port 11011
MERGE Data Link (24273): client closed connection
MERGE Data Link (24273): listening on port 11020
MERGE Data Link (24271): client closed connection
MERGE Data Link (24271): listening on port 11018
MERGE Data Link (24277): client closed connection
MERGE Data Link (24277): listening on port 11024
MERGE Data Link (24275): client closed connection
MERGE Data Link (24275): listening on port 11022
MERGE Data Link (24274): client closed connection
MERGE Data Link (24274): listening on port 11021
MERGE Data Link (24276): client closed connection
MERGE Data Link (24276): listening on port 11023
Attachment 1: 1.png
1.png
Attachment 2: 2.png
2.png
Attachment 3: 3.png
3.png
Attachment 4: 4.png
4.png
Attachment 5: 5.png
5.png
Attachment 6: 6.png
6.png
  336   Wed Aug 17 14:44:52 2016 Patrick Coleman-Smith[HowTo] start acquisition using the new "Start ASIC Readout" button

Due to sometimes operating with very noisy systems, both at Daresbury and RIKEN, I have installed a change to the startup of the system.

The Runcontrol GO button now only starts the output of SYNC pulses. A further button labelled "Start ASIC Readout" ( see attached photo ) needs to be pressed to start the ASIC ADC, Discriminator and Correlation readout.

After carrying out the normal operations to start the system and pressing the GO button the Merger can be checked to be certain it is merging the events. Since there are only SYNC pulses it should soon be apparent if this is not happening and appropriate actions can be taken. ( Kill/Reload the Link and Merger processes using the "Merger for AIDA" icon at the top of the screen is one way. )

Once it is certain the merging is in progress then the new button labelled "Start ASIC Readout" ( see attached photo ) needs to be pressed to start the ASIC ADC, Discriminator and Correlation readout.

This must be done after every GO.

Attachment 1: RunControl_portion.png
RunControl_portion.png
  335   Wed Aug 17 14:10:05 2016 Patrick Coleman-SmithLinux Memory problem solutions for FEE64

The problem indicated by "page allocation failure" and the kernal subsequently "killing" the AidaExecV8 process is due ( it appears ) to memory availability.

Solution 1:- Reduce the amount of memory used for histogramming. That would be by disabling the .V histograms or changing the size of the .H and/or .L histograms from 65536 to 32768 and using the shift attribute in the Options. This will reduce the memory required.

Solution 2:- Change a kernal setting in the /etc/sysctl.conf to request there is always a minimum amount of free memory. Insert the line "vm.min_free_kbytes=4096" in the file.

Solution 2 has been tried on the system in T9 while the flag enabling raw data to be written to disc is set. Previous to the solution, with the same operating conditions,  the fault would occur after about 15 minutes. Subsequent to the solution the system has run for 46 hours.

I propose that the .V histograms are disabled and the kernal setting is used. ( both solutions :-) The .V histograms are a late addition and not required at present.

 

  334   Mon Aug 15 09:55:08 2016 Patrick Coleman-SmithFound a candidate for DAQ failure.

The problem :-

The DAQ stops working and can only be fixed by a reboot of the FEE64s.

The Candidate:-

The DAQ program running in the FEE64s which handles the transfer of the data items from the FPGA to the Merger, AidaExecV8, uses a device driver, aidamem, to copy the data from DMA memory into Linux memory.

Sometimes AidaExecV8 can be killed by the Kernal due to a "page allocation failure" which occurs when the Kernal memory space has become fragmented. So when a block of contiguous memory is requested to receive the copy of the data from the FPGA DMA memory the Kernal can't allocate memory and kills the process.

The effect of this would be seen at the Merger where it would be waiting for data from the FEE.

Since the AidaExecV8 has been killed then there will be no response to status requests.

A recent change to the operation of the Aida system , flushing buffers regularly, is the most likely cuplrit. The flush of the buffers on a slow FEE will use small memory blocks while faster FEE data will always require full size buffers and mean that the Kernal memory will have large contiguous areas of memory in constant use. This will explain why this failure type is relatively recent .

A solution :- Change the request for memory copy to always use the maximum size. This way there shouldn't be the fragmentation of the memory. Also the fragmentation of memory will be investigated to see if there is a away to monitor it.

 

Some corroboration:-

The error messages from the Kernal that indicate this has fault has occured can be read from the FEE64 root file system. At the text file /var/log/messages. A grep of these in RIKEN using the phrase "page allocation failure" shows they have occured. It remains to see if the date and times of the failures align with the date and times of the system failure.

 

All comments and ideas gratefuly received.

###### # ##  # # # # #   Tested this solution on T9 system and it doesn't work ! The failure still occurs when the DAQ is set to write all input to disc.

  333   Mon Aug 1 11:30:07 2016 Patrick Coleman-SmithFirst time remotely power up HV in RIKEN from Daresbury

Logged into aidas1 and opened the nnrpi1 relay control window ... powered up the FEE64s.

seperatly connected to ribfdaq and opened a firefox to aidas1:8015.

By the time the firefox had appeared on my desktop the FEE64s had started up.

Ran through the standard startup. Set thresholds to 255 for all 3 sets of comparators. Disabled all discriminators.

Enable histograms.

Started DAQ.

Using ELOG 74 as a guide logged into nnrpi1 and ran two putty sessions ( USB2 and USB3 ).

Based on the instructions in the HV unit manual attached to ELOG75 powered ion all 6 detectors ( current when settled at 100v = 3 to 5uA )

Using spectrum broswer at nnaida6 and ASIC4 to control the slow comparator threshold ..... needed to reach 32 before the pulser became visible.

Using nnaida6:1.3.L found the pulser peak at channel 32136 with a peak width of 67.73 channels.

Then poweroff the HV on all 6 detectors.

Power off the FEE64s.

Logout of all RIKEN machines and consider the best way to take advantage of the situation which allows the system to be operated completely remotely.

 

 

  332   Sun Jul 31 09:38:07 2016 TDAnalysis of AIDA75_1
Attachment 2 - analysis from *end* of file AIDA75_1

2D spectrum - FEE # versus timestamp (100us/channel)

1D spectrum - projection of timestamp data for FEE # = 6

Conclude data synchronised at beginning and end of AIDA75_1. 
Period of pulser data ~36ch = 3.6ms => pulser frequency ~ 280Hz

Attachment 3 - analysis around ts=0x18bacd75c4 +/- 20000000 (dec)

Conclude FEE # 6 stops producing ADC data c. ch 353

Attachment 4 - analysis around ts=0x18cac6144b +/- 20000000 (dec)

Conclude FEE # 6 starts producing ADC data c. ch 28

Spectrum s4100 y-channel calculated as follows

C        IF ( ts.GE.z'18bacd75c4' - 20000000
C----67---
C     +       .AND.
C     +       ts.LE.z'18bacd75c4' + 20000000 ) THEN
        IF ( ts.GE.z'18cac6144b' - 20000000
C----67---
     +       .AND.
     +       ts.LE.z'18cac6144b' + 20000000 ) THEN
C
C
C       Increment ts spectra per FEE64 module

        j = MOD( INT( ts*10.0D-09*1.0D+04 ), 4096 )
        IF ( j.EQ.0 ) THEN
         DO k = 0, 4095
          DO l = 1, 31
           CALL set2d( 4100, l, k, 0 )
          ENDDO
         ENDDO
        ENDIF
        CALL inc2d( 4100, module, j )

        ENDIF
Attachment 1: AIDA75_1.txt
[td@localhost ~/GREAT]$ ./analyser2 /media/1E47-5D81/RIKEN/NP1512/AIDA75_1
 *** GREAT format 3.2.0 analyser - TD - May 2014
 *** ERROR: READ I/O error:       5002
                   blocks:      32000
          ADC data format:  229487868 (  519604.0 Hz)
        Other data format:   31632132 (   71621.1 Hz)
 Sample trace data format:          0 (       0.0 Hz)
         Undefined format:          0 (       0.0 Hz)
   Other data format type:      PAUSE:          0 (       0.0 Hz)
                               RESUME:          0 (       0.0 Hz)
                              SYNC100:     168477 (     381.5 Hz)
                           FEE64 disc:   31101492 (   70419.7 Hz)
                             MBS info:     362163 (     820.0 Hz)
                           Other info:          0 (       0.0 Hz)

   ADC data range bit set:    1805055 (    4087.0 Hz)

                Timewarps:        ADC:          0 (       0.0 Hz)
                                PAUSE:          0 (       0.0 Hz)
                               RESUME:          0 (       0.0 Hz)
                              SYNC100:          0 (       0.0 Hz)
                           FEE64 disc:          0 (       0.0 Hz)
                             MBS info:          0 (       0.0 Hz)
                            Undefined:          0 (       0.0 Hz)
                         Sample trace:          0 (       0.0 Hz)

 Timestamp elapsed time:      441.659 s
 FEE module #:   1 elapsed dead time       0.000 s
 FEE module #:   2 elapsed dead time       0.000 s
 FEE module #:   3 elapsed dead time       0.000 s
 FEE module #:   4 elapsed dead time       0.000 s
 FEE module #:   5 elapsed dead time       0.000 s
 FEE module #:   6 elapsed dead time       0.000 s
 FEE module #:   7 elapsed dead time       0.000 s
 FEE module #:   8 elapsed dead time       0.000 s
 FEE module #:   9 elapsed dead time       0.000 s
 FEE module #:  10 elapsed dead time       0.000 s
 FEE module #:  11 elapsed dead time       0.000 s
 FEE module #:  12 elapsed dead time       0.000 s
 FEE module #:  13 elapsed dead time       0.000 s
 FEE module #:  14 elapsed dead time       0.000 s
 FEE module #:  15 elapsed dead time       0.000 s
 FEE module #:  16 elapsed dead time       0.000 s
 FEE module #:  17 elapsed dead time       0.000 s
 FEE module #:  18 elapsed dead time       0.000 s
 FEE module #:  19 elapsed dead time       0.000 s
 FEE module #:  20 elapsed dead time       0.000 s
 FEE module #:  21 elapsed dead time       0.000 s
 FEE module #:  22 elapsed dead time       0.000 s
 FEE module #:  23 elapsed dead time       0.000 s
 FEE module #:  24 elapsed dead time       0.000 s
 FEE module #:  25 elapsed dead time       0.000 s
 FEE module #:  26 elapsed dead time       0.000 s
 FEE module #:  27 elapsed dead time       0.000 s
 FEE module #:  28 elapsed dead time       0.000 s
 FEE module #:  29 elapsed dead time       0.000 s
 FEE module #:  30 elapsed dead time       0.000 s
 FEE module #:  31 elapsed dead time       0.000 s
 FEE module #:  32 elapsed dead time       0.000 s

 *** Program elapsed time:  347.762s (   92.017 blocks/s,   5.751 Mb/s)

Attachment 2: Untitled.png
Untitled.png
Attachment 3: 200.png
200.png
Attachment 4: 400.png
400.png
  331   Thu Jul 28 11:35:58 2016 Patrick Coleman-Smith[HowTo] Apply Thermal paste to FPGA
Attachment 1: Process_for_applying_thermal_paste_to_the_FPGA_in_FEE64.pdf
Process_for_applying_thermal_paste_to_the_FPGA_in_FEE64.pdf Process_for_applying_thermal_paste_to_the_FPGA_in_FEE64.pdf Process_for_applying_thermal_paste_to_the_FPGA_in_FEE64.pdf Process_for_applying_thermal_paste_to_the_FPGA_in_FEE64.pdf Process_for_applying_thermal_paste_to_the_FPGA_in_FEE64.pdf
  330   Mon Jul 25 08:38:56 2016 TDAnalysis of R4
Attachment 1: R4.out
[td@localhost ~/GREAT]$ ./analyser2 /media/1E47-5D81/RIKEN/July2016/R4_0
 *** GREAT format 3.2.0 analyser - TD - May 2014
 *** ERROR: READ I/O error:       5002
                   blocks:      32000
          ADC data format:  259005266 ( 1007210.1 Hz)
        Other data format:    2082734 (    8099.3 Hz)
 Sample trace data format:          0 (       0.0 Hz)
         Undefined format:          0 (       0.0 Hz)
   Other data format type:      PAUSE:       1991 (       7.7 Hz)
                               RESUME:       1971 (       7.7 Hz)
                              SYNC100:    2078772 (    8083.9 Hz)
                           FEE64 disc:          0 (       0.0 Hz)
                             MBS info:          0 (       0.0 Hz)
                           Other info:          0 (       0.0 Hz)

   ADC data range bit set:      11844 (      46.1 Hz)

                Timewarps:        ADC:          0 (       0.0 Hz)
                                PAUSE:          0 (       0.0 Hz)
                               RESUME:          0 (       0.0 Hz)
                              SYNC100:          0 (       0.0 Hz)
                           FEE64 disc:          0 (       0.0 Hz)
                             MBS info:          0 (       0.0 Hz)
                            Undefined:          0 (       0.0 Hz)
                         Sample trace:          0 (       0.0 Hz)

 Timestamp elapsed time:      257.151 s
 FEE module #:   1 elapsed dead time      16.704 s
 FEE module #:   2 elapsed dead time      20.156 s
 FEE module #:   3 elapsed dead time      20.977 s
 FEE module #:   4 elapsed dead time      20.836 s
 FEE module #:   5 elapsed dead time      19.748 s
 FEE module #:   6 elapsed dead time      21.220 s
 FEE module #:   7 elapsed dead time      20.842 s
 FEE module #:   8 elapsed dead time      21.020 s
 FEE module #:   9 elapsed dead time      19.873 s
 FEE module #:  10 elapsed dead time      19.688 s
 FEE module #:  11 elapsed dead time      21.642 s
 FEE module #:  12 elapsed dead time      20.179 s
 FEE module #:  13 elapsed dead time      17.641 s
 FEE module #:  14 elapsed dead time      19.365 s
 FEE module #:  15 elapsed dead time      20.292 s
 FEE module #:  16 elapsed dead time      20.291 s
 FEE module #:  17 elapsed dead time      18.133 s
 FEE module #:  18 elapsed dead time      16.689 s
 FEE module #:  19 elapsed dead time       0.000 s
 FEE module #:  20 elapsed dead time      20.003 s
 FEE module #:  21 elapsed dead time      19.553 s
 FEE module #:  22 elapsed dead time      19.721 s
 FEE module #:  23 elapsed dead time      19.151 s
 FEE module #:  24 elapsed dead time      18.508 s
 FEE module #:  25 elapsed dead time       0.000 s
 FEE module #:  26 elapsed dead time       0.000 s
 FEE module #:  27 elapsed dead time       0.000 s
 FEE module #:  28 elapsed dead time       0.000 s
 FEE module #:  29 elapsed dead time       0.000 s
 FEE module #:  30 elapsed dead time       0.000 s
 FEE module #:  31 elapsed dead time       0.000 s
 FEE module #:  32 elapsed dead time       0.000 s

 *** Program elapsed time:  343.412s (   93.183 blocks/s,   5.824 Mb/s)
[td@localhost ~/GREAT]$ ./analyser2 /media/1E47-5D81/RIKEN/July2016/R4_244
 *** GREAT format 3.2.0 analyser - TD - May 2014
 *** ERROR: READ I/O error:       5002
                   blocks:       9200
          ADC data format:   18124592 (  988093.9 Hz)
        Other data format:   56938016 ( 3104075.8 Hz)
 Sample trace data format:          0 (       0.0 Hz)
         Undefined format:          0 (       0.0 Hz)
   Other data format type:      PAUSE:       5270 (     287.3 Hz)
                               RESUME:       5270 (     287.3 Hz)
                              SYNC100:       6956 (     379.2 Hz)
                           FEE64 disc:   56920520 ( 3103122.0 Hz)
                             MBS info:          0 (       0.0 Hz)
                           Other info:          0 (       0.0 Hz)

   ADC data range bit set:        955 (      52.1 Hz)

                Timewarps:        ADC:          0 (       0.0 Hz)
                                PAUSE:          0 (       0.0 Hz)
                               RESUME:          0 (       0.0 Hz)
                              SYNC100:          0 (       0.0 Hz)
                           FEE64 disc:          0 (       0.0 Hz)
                             MBS info:          0 (       0.0 Hz)
                            Undefined:          0 (       0.0 Hz)
                         Sample trace:          0 (       0.0 Hz)

 Timestamp elapsed time:       18.343 s
 FEE module #:   1 elapsed dead time      17.129 s
 FEE module #:   2 elapsed dead time      17.600 s
 FEE module #:   3 elapsed dead time      17.132 s
 FEE module #:   4 elapsed dead time      17.292 s
 FEE module #:   5 elapsed dead time      17.896 s
 FEE module #:   6 elapsed dead time      17.700 s
 FEE module #:   7 elapsed dead time      16.995 s
 FEE module #:   8 elapsed dead time      17.015 s
 FEE module #:   9 elapsed dead time      17.739 s
 FEE module #:  10 elapsed dead time      17.503 s
 FEE module #:  11 elapsed dead time      16.615 s
 FEE module #:  12 elapsed dead time      16.941 s
 FEE module #:  13 elapsed dead time      14.707 s
 FEE module #:  14 elapsed dead time      17.509 s
 FEE module #:  15 elapsed dead time      16.625 s
 FEE module #:  16 elapsed dead time      16.813 s
 FEE module #:  17 elapsed dead time      17.163 s
 FEE module #:  18 elapsed dead time      17.221 s
 FEE module #:  19 elapsed dead time       0.000 s
 FEE module #:  20 elapsed dead time      17.184 s
 FEE module #:  21 elapsed dead time      17.588 s
 FEE module #:  22 elapsed dead time      17.028 s
 FEE module #:  23 elapsed dead time      16.955 s
 FEE module #:  24 elapsed dead time      16.646 s
 FEE module #:  25 elapsed dead time       0.000 s
 FEE module #:  26 elapsed dead time       0.000 s
 FEE module #:  27 elapsed dead time       0.000 s
 FEE module #:  28 elapsed dead time       0.000 s
 FEE module #:  29 elapsed dead time       0.000 s
 FEE module #:  30 elapsed dead time       0.000 s
 FEE module #:  31 elapsed dead time       0.000 s
 FEE module #:  32 elapsed dead time       0.000 s

 *** Program elapsed time:   98.121s (   93.762 blocks/s,   5.860 Mb/s)
  329   Wed Jul 20 18:41:55 2016 DKAIDA topped up on water

 I actually topped off the chiller with water tonight as I am returning to Scotland (and then off to CERN) within hours.

I added 4 liters on around July 5th, to move the marker from a little under the halfway mark to over the halfway mark.  After 2.x weeks, it was still sitting a bit shy of under the halfway mark.  This means that in a humid and hot Japanese month like July, AIDA doesn't evaporate more than something like 2 liters in one week. 

As it's now topped off, I think it's good to go for something like several months.

I also note that I use a faucet in the old Nishina Center, rather than passing radiation control, to find a water tap.  It's maybe more of a hike but it avoids going through radiation control.

Anyway, the point of my elog post is that the AIDA chiller will not be running out of water any time soon, probably not before October, though local members at RIKEN might check it each month etc, or on request.  I left the bucket and the large-volume container right sitting by it.  I still don't know where the funnel is, so I robbed (borrowed) a kind of 1 liter handled jug with a beaker from a place, and returned it.

  328   Mon Jul 18 12:00:16 2016 Patrick Coleman-SmithReport of remote operation of Aida at RIKEN 11/7 to 14/7

11th July ---------------------------------------------------------------------------------------------------------

09:32 UK time, logged in and power-up all FEEs OK. Setup fine, Temperature Max 12@64

System Checks all fine SYNC count 2243584.

Start Merger and Toggle Pause for no merging.

DAQ start with output enabled.

Toggle Pause in Merger -> “Want First SYNC” with no change.

Tried this several times with the same result.

Checked DAQ statistics => Buffers/sec = 50 to 65, SYNC/sec = 40 to 80  ( should be 384 )

Disabled all ASIC readout and disable all discriminators. DAQ still in GO state.

Fetched new version of ASIC4 with Enable/Disable ASIC readout commands in the “Expert” menu.

SYNC statics now all FEEs are 384/sec.

Merger … Toggle Pause and its fine. 334 items/sec.

DAQ : ASIC4 : Set all slow comparator Thresholds to 250 : Enable ASICs for readout.

Merger now 532811 items/sec.

DAQ SYNC statistics range 307 to 390/sec

LT 22:36 System Checks => Sync counts 6518784 all ok, Sync errors all passed.

LT 23:30 Merger 539826 items/sec. Temperatures => 12@68.12 ; SYNC statistics 361 to 385 ; SYNC counts 8099840 all ok ; Sync errors all ok ; ASIC clock timestamp all ok.

STOP then Power-down all fine.

13th July ----------------------------------------------------------------------------------------------------------------------

Log in and power-up fine.

Backup Merger file ( merge.AD )  and copy latest version from DL. Won’t copy as file already open.

So instead …. Disable ASIC readout, disable Discriminators and run the system.

When Merger is operating , Enable ASIC readout => Merger rate is 8,058,412 items/sec.

LT 19:44 => Temperatures max 12@68.81 , SYNC received 845824 all ok; Sync errors all ok.

LT 20:11 => Merge rate 7,991,198 items/sec, SYNC received 1551359 all ok; Sync errors all ok.

LT 21:07 => Temperatures Max 12@69.00 ; SYNC received 2742272 all.

LT 23:20 => Stopped the whole system.

Kill the Merge process and the Startup process. Copy new Merge.AD successful. Edit Merger startup script to include netint OutputAll 1.

Start up Merger using /MIDAS/Merger/AIDA/Startup in terminal connected to aidas1.

Merger : Setup and configure ; Go ; Toggle Pause All Ok.

Go DAQ   - disable all ASIC readout.

Merger : Toggle Pause ; Merging 8832 items/sec ( = 384 x 23 …. OK )

Connect to TapeServer setup for R3 into /TapeData/July2016 , Go Tape ok.

Merger : Toggle Transfer.

LT 00:08 => enable ASICS for readout. Run for a short while. Merge spinner hesitates for 2 seconds at a time. Merge rate 5,951,471 items/sec. Tape server rate 3,5148 Kbytes/sec.

Stop system, SYNC received 8085504 all ok ; sync error all ok. Power down. Sftp tape data files to nndhcp052 in Daresbury.

14th July -------------------------------------------------------------------------------------------------------------------------

Startup and run the system to tape again R4, ASICs with no discriminators. All fine.

LT 20:19 Enable Discriminators. Merger rate 3.9Mi/sec ( Mega items/second)

LT 20:57 Temperature Max 12@69 , Merge rate 4.2Mi/sec

UK 14:49 Network failure between Daresbury and Edinburgh. Lost contact repeatedly 4 times in a row. Contacted DL network support. No local networking faults.

LT 23:05 Re-connected to Aida in RIKEN. Merger 3.3Mi/sec.

Decided to stop due to concerns about flaky network and leaving the system powered up over the weekend.

SYNC received 7222272 all ok; sync error ok.

Power down. R4_244 is the last file. Written.

 

  327   Thu Jul 14 16:55:32 2016 Patrick Coleman-SmithChanges to AIDA software

An updated copy of the merger has been downloaded ( merger64.AD ) with a copy of /MIDAS/Merger/AIDA/Startup.

These two enable the output of all of the SYNC data items to the TapeServer.

 

The System Wide Checks browser window has been updated and includes a report of the number SYNC pulses sent by the Master timestamp module and the number of SYNC pulses received by the Slave modules. These should all be the same value.

 

The ASIC4 browser window has been updated to add two actions to the Experts only menu. "Disable all ASICs For Readout"  and "Enable all ASICs for Readout". These two are useful when starting a noisy system. The sequence would be to start the Merger, toggle Pause. Then start the DAQ ( FEE64s ). "Disable all ASICs For Readout" then in the Merger toggle Pause. I found this made starting the merging much more successful. Then "Enable all ASICs for Readout" to run. I shall add similar actions to the Discriminators but will ensure the Mask Patterns set by the Restore are maintained and restored by the "Enable" action.

Before adding this capability it was quite hard to tune the system to guarantee the Merger a full set of SYNC data items at any one time. I have been running the FEE64s with low thresholds, detectors and no HV.

 

  326   Fri Jul 8 18:54:47 2016 TDAnalysis July2016/R1 data
See https://elog.ph.ed.ac.uk/AIDA/324

Analysis of first and last data file - see attachments 1-2
Rates per FEE64 ~20k data items/s

analyser2.f program - see attachment 3

To compile the analyser2.f program (SL6)

gfortran analyser2.f -o analyser2

analyser2 <filename> to produce summary statistics
analyser2 v <filename> to produce verbose analysis

ADC multiplicity as a function of time (1 channel = 0.1ms) for file R1_114 - see attachment 4
Pulser running at ~270Hz clearly identifiable

Conclusion
DAQ ran overnight without stalling
DAQ did not lose synchronisation

Decrease comparator thresholds to increase rates to experiment-realistic levels ~50k data items/s/FEE64
Attachment 1: R1_0.out
[td@localhost ~/GREAT]$ ./analyser2 /media/1E47-5D81/RIKEN/July2016/R1_0
 *** GREAT format 3.2.0 analyser - TD - May 2014
 *** ERROR: READ I/O error:       5002
                   blocks:      32000
          ADC data format:  260931056 (  524366.1 Hz)
        Other data format:     180163 (     362.1 Hz)
 Sample trace data format:          0 (       0.0 Hz)
         Undefined format:          0 (       0.0 Hz)
   Other data format type:      PAUSE:          0 (       0.0 Hz)
                               RESUME:          0 (       0.0 Hz)
                              SYNC100:     179382 (     360.5 Hz)
                           FEE64 disc:        781 (       1.6 Hz)
                             MBS info:          0 (       0.0 Hz)
                           Other info:          0 (       0.0 Hz)

   ADC data range bit set:        717 (       1.4 Hz)

                Timewarps:        ADC:          0 (       0.0 Hz)
                                PAUSE:          0 (       0.0 Hz)
                               RESUME:          0 (       0.0 Hz)
                              SYNC100:          0 (       0.0 Hz)
                           FEE64 disc:          0 (       0.0 Hz)
                             MBS info:          0 (       0.0 Hz)
                            Undefined:          0 (       0.0 Hz)
                         Sample trace:          0 (       0.0 Hz)

 Timestamp elapsed time:      497.612 s
 FEE module #:   1 elapsed dead time       0.000 s
 FEE module #:   2 elapsed dead time       0.000 s
 FEE module #:   3 elapsed dead time       0.000 s
 FEE module #:   4 elapsed dead time       0.000 s
 FEE module #:   5 elapsed dead time       0.000 s
 FEE module #:   6 elapsed dead time       0.000 s
 FEE module #:   7 elapsed dead time       0.000 s
 FEE module #:   8 elapsed dead time       0.000 s
 FEE module #:   9 elapsed dead time       0.000 s
 FEE module #:  10 elapsed dead time       0.000 s
 FEE module #:  11 elapsed dead time       0.000 s
 FEE module #:  12 elapsed dead time       0.000 s
 FEE module #:  13 elapsed dead time       0.000 s
 FEE module #:  14 elapsed dead time       0.000 s
 FEE module #:  15 elapsed dead time       0.000 s
 FEE module #:  16 elapsed dead time       0.000 s
 FEE module #:  17 elapsed dead time       0.000 s
 FEE module #:  18 elapsed dead time       0.000 s
 FEE module #:  19 elapsed dead time       0.000 s
 FEE module #:  20 elapsed dead time       0.000 s
 FEE module #:  21 elapsed dead time       0.000 s
 FEE module #:  22 elapsed dead time       0.000 s
 FEE module #:  23 elapsed dead time       0.000 s
 FEE module #:  24 elapsed dead time       0.000 s
 FEE module #:  25 elapsed dead time       0.000 s
 FEE module #:  26 elapsed dead time       0.000 s
 FEE module #:  27 elapsed dead time       0.000 s
 FEE module #:  28 elapsed dead time       0.000 s
 FEE module #:  29 elapsed dead time       0.000 s
 FEE module #:  30 elapsed dead time       0.000 s
 FEE module #:  31 elapsed dead time       0.000 s
 FEE module #:  32 elapsed dead time       0.000 s

 *** Program elapsed time:  341.047s (   93.829 blocks/s,   5.864 Mb/s)

Attachment 2: R1_114.out
td@localhost ~/GREAT]$ ./analyser2 /media/1E47-5D81/RIKEN/July2016/R1_114
 *** GREAT format 3.2.0 analyser - TD - May 2014
 *** ERROR: READ I/O error:       5002
                   blocks:      19133
          ADC data format:  156016542 (  554000.1 Hz)
        Other data format:     108244 (     384.4 Hz)
 Sample trace data format:          0 (       0.0 Hz)
         Undefined format:          0 (       0.0 Hz)
   Other data format type:      PAUSE:          0 (       0.0 Hz)
                               RESUME:          0 (       0.0 Hz)
                              SYNC100:     107429 (     381.5 Hz)
                           FEE64 disc:        815 (       2.9 Hz)
                             MBS info:          0 (       0.0 Hz)
                           Other info:          0 (       0.0 Hz)

   ADC data range bit set:        640 (       2.3 Hz)

                Timewarps:        ADC:          0 (       0.0 Hz)
                                PAUSE:          0 (       0.0 Hz)
                               RESUME:          0 (       0.0 Hz)
                              SYNC100:          0 (       0.0 Hz)
                           FEE64 disc:          0 (       0.0 Hz)
                             MBS info:          0 (       0.0 Hz)
                            Undefined:          0 (       0.0 Hz)
                         Sample trace:          0 (       0.0 Hz)

 Timestamp elapsed time:      281.618 s
 FEE module #:   1 elapsed dead time       0.000 s
 FEE module #:   2 elapsed dead time       0.000 s
 FEE module #:   3 elapsed dead time       0.000 s
 FEE module #:   4 elapsed dead time       0.000 s
 FEE module #:   5 elapsed dead time       0.000 s
 FEE module #:   6 elapsed dead time       0.000 s
 FEE module #:   7 elapsed dead time       0.000 s
 FEE module #:   8 elapsed dead time       0.000 s
 FEE module #:   9 elapsed dead time       0.000 s
 FEE module #:  10 elapsed dead time       0.000 s
 FEE module #:  11 elapsed dead time       0.000 s
 FEE module #:  12 elapsed dead time       0.000 s
 FEE module #:  13 elapsed dead time       0.000 s
 FEE module #:  14 elapsed dead time       0.000 s
 FEE module #:  15 elapsed dead time       0.000 s
 FEE module #:  16 elapsed dead time       0.000 s
 FEE module #:  17 elapsed dead time       0.000 s
 FEE module #:  18 elapsed dead time       0.000 s
 FEE module #:  19 elapsed dead time       0.000 s
 FEE module #:  20 elapsed dead time       0.000 s
 FEE module #:  21 elapsed dead time       0.000 s
 FEE module #:  22 elapsed dead time       0.000 s
 FEE module #:  23 elapsed dead time       0.000 s
 FEE module #:  24 elapsed dead time       0.000 s
 FEE module #:  25 elapsed dead time       0.000 s
 FEE module #:  26 elapsed dead time       0.000 s
 FEE module #:  27 elapsed dead time       0.000 s
 FEE module #:  28 elapsed dead time       0.000 s
 FEE module #:  29 elapsed dead time       0.000 s
 FEE module #:  30 elapsed dead time       0.000 s
 FEE module #:  31 elapsed dead time       0.000 s
 FEE module #:  32 elapsed dead time       0.000 s

 *** Program elapsed time:  203.500s (   94.020 blocks/s,   5.876 Mb/s)
Attachment 3: analyser2.f
      PROGRAM GREAT

C     Read and analyse GREAT format data - version 3.20
C
C     See:
C     http://npg.dl.ac.uk/documents/edoc504/edoc504.html
C     http://npg.dl.ac.uk/DataAcq/TSformat.html

      IMPLICIT NONE
C
C     Parameter variables
C
      INTEGER block_length
      PARAMETER (block_length = 16384)
C
C     Argument variables
C
      CHARACTER*255 filename
C
      INTEGER max_data_words, nowords
C
C     Local variables
C
      CHARACTER*(block_length*4) buffer
      CHARACTER*255 arg
C
      INTEGER block_number, block_ptr, data_block(block_length), event
      INTEGER header_data(6), high_water_mark, i
      INTEGER*2 i2(2), i2_2(4)
      INTEGER i4, j4, i4_2(2), io_error, tag, items( 0:31 )
      INTEGER fail, range, channel_ident, adc_data, module
      INTEGER channel, type, information, ts28, ts48, sample_length
      INTEGER information_index, mbs_data, ch, tag_old, type_old
      integer*8 i8, ts, ts_old, iargc, first_ts, last_ts, sync_old
      integer*8 pause(32), resume(32), dead(32)

      REAL dt, t1, t2, secnds

      LOGICAL verbose

      EQUIVALENCE ( data_block, buffer )
      EQUIVALENCE ( i4, i2( 1 ) )
      EQUIVALENCE ( i8, i4_2(1) )

      SAVE data_block, block_ptr, block_number, event, high_water_mark

C----67---------------------------------------------------------------72------80
C     Announce program

      WRITE( 6, 9000 )

C     Initialise variables

      block_ptr = 1
      block_number = 0
      event = 0

      ts_old = 0
      tag_old = 0
      type_old = 0
      first_ts = 0
      last_ts = 0

      verbose = .FALSE.
      
      DO i = 0, 31
       items( i ) = 0
      ENDDO

      DO i = 1, 32
       pause( i ) = -1
       resume( i ) = -1
       dead( i ) = 0
      ENDDO

C     Get input filename from command line argument

      DO i = 1, iargc()
       CALL getarg( i, arg )
       IF ( arg.EQ.'v') THEN
        verbose = .TRUE.
        write(6,*) ' verbose'
       ELSE
        filename = arg
       ENDIF
      ENDDO
      IF ( iargc().LT.1 .OR. iargc().GT.2 ) THEN
       WRITE( 6, * ) ' Usage: ./a.exe [v] <filename>'
       STOP
      ENDIF

      t1 = SECNDS( 0.0 )

C     Open file

      OPEN( 10, RECL = 4 * block_length,
     +      FILE = filename, STATUS = 'OLD', ACCESS = 'DIRECT' )

    1 CONTINUE

C     Read next block
C     Note: for FORTRAN direct I/O first block/record number is 1
C           for GREAT data format first (encoded) block/record number is 0

      block_number = block_number + 1

      READ( 10, REC = block_number, iostat=io_error, ERR=1000 ) buffer

      DO block_ptr = 7, data_block( 6 ) / 4 + 4, 2


       i4 = data_block( block_ptr )
       j4 = data_block( block_ptr + 1 )
C      IF ( block_number.EQ.1 ) THEN
C       IF ( block_ptr.LE.16 .OR. block_ptr.gt.16200 ) THEN 
C        WRITE( 6, 9500 ) block_ptr, i4, j4
C       ENDIF
C      ENDIF

       tag = AND( RSHIFT( i4, 30 ), Z'00000003' )

C      tag = 3 = ADC data format
C      tag = 2 = Other data
C      tag = 1 = Sample trace buffer format
C      tag = 0 = undefined?

C----67---------------------------------------------------------------72------80
C     ADC data

       IF ( tag.EQ.3 ) THEN

        items( 0 ) = items( 0 ) + 1

        fail = AND( RSHIFT( i4, 29 ), Z'00000001' )

        IF ( fail.NE.0 ) THEN

         WRITE( 6, 9600 ) block_number, block_ptr, i4

        ENDIF

        range = AND( RSHIFT( i4, 28 ), Z'00000001' )
        IF ( range.EQ.1 ) THEN
         items( 19 ) = items( 19 ) + 1
        ENDIF
        channel_ident = AND( RSHIFT( i4, 16 ), Z'00000fff' )
        module = AND( RSHIFT( channel_ident, 6 ), Z'0000003f' )
        channel = AND( channel_ident, Z'0000003f' )
        adc_data = AND( i4, Z'0000ffff' )
        ts28 = AND( data_block( block_ptr + 1 ), Z'0fffffff' )

        i8 = 0
        IF ( ts28.LT.Z'00000A0' ) THEN
         i4_2( 1 ) = ts48 + 1
        ELSE
         i4_2( 1 ) = ts48
        ENDIF
        ts = LSHIFT( i8, 28 )
        i8 = 0
        i4_2(1) = ts28
        ts = ts + i8

	IF ( ts.LT.ts_old ) THEN

	 items( 20 ) = items( 20 ) + 1

         WRITE( 6, 9710 ) block_number, block_ptr, i4, module, fail,
     +   range, channel_ident, channel, adc_data, ts, ts_old, tag_old,
     +   type_old
        ENDIF

        ts_old = ts
        tag_old = tag
        type_old = 0

        IF ( verbose ) THEN 
         WRITE( 6, 9700 ) block_number, block_ptr, i4, module, fail,
     +   range, channel_ident, channel, adc_data, ts
        ENDIF

        ch = channel + ( module - 1 ) * 64 + ( range * 2048 )

C----67---------------------------------------------------------------72------80
C     Other information

       ELSEIF( tag.EQ.2 ) THEN

        items( 1 ) = items( 1 ) + 1

        module = AND( RSHIFT( i4, 24 ), Z'0000003f' )
        type = AND( RSHIFT( i4, 20 ), Z'0000000f' )
        information = AND( i4, Z'000fffff' )
        ts28 = AND( data_block( block_ptr + 1 ), Z'0fffffff' )

        IF ( type.EQ.2 ) THEN

C----67---------------------------------------------------------------72------80
C     PAUSE timestamp

         items( 4 ) = items( 4 ) + 1

         ts48 = information

        i8 = 0
        IF ( ts28.LT.Z'00000A0' ) THEN
         i4_2( 1 ) = ts48 + 1
        ELSE
         i4_2( 1 ) = ts48
        ENDIF
        ts = LSHIFT( i8, 28 )
        i8 = 0
        i4_2(1) = ts28
        ts = ts + i8

         IF ( ts.LT.ts_old ) THEN

          items( 21 ) = items( 21 ) + 1

         ENDIF
         ts_old = ts
         type_old = type
         tag_old = tag
         pause( module ) = ts

         IF ( verbose ) THEN 
          WRITE( 6, 9800 ) block_number, block_ptr, i4, module, type,
     +     information, ts
         ENDIF

        ELSEIF( type.EQ.3 ) THEN

C----67---------------------------------------------------------------72------80
C     RESUME timestamp

         items( 5 ) = items( 5 ) + 1

         ts48 = information

        i8 = 0
        IF ( ts28.LT.Z'00000A0' ) THEN
         i4_2( 1 ) = ts48 + 1
        ELSE
         i4_2( 1 ) = ts48
        ENDIF
        ts = LSHIFT( i8, 28 )
        i8 = 0
        i4_2(1) = ts28
        ts = ts + i8

         IF ( ts.LT.ts_old ) THEN

          items( 22 ) = items( 22 ) + 1

         ENDIF
         ts_old = ts
         type_old = type
         tag_old = tag
         resume( module ) = ts

         IF ( resume( module ).GE.0 
     +         .AND.
     +        pause( module ).GE.0 
     +          .AND.
     +        resume( module ).GE.pause( module ) ) THEN
          dead( module ) = ( resume( module ) - pause( module ) ) 
     +                     + dead( module )
          pause( module ) = -1
          resume( module ) = -1
         ENDIF

         IF ( verbose ) THEN 
          WRITE( 6, 9810 ) block_number, block_ptr, i4, module, type,
     +     information, ts
         ENDIF

        ELSEIF( type.EQ.4 ) THEN

C----67---------------------------------------------------------------72------80
C     SYNC100 timestamp

         items( 6 ) = items( 6 ) + 1

         ts48 = information

        i8 = 0
        IF ( ts28.LT.Z'00000A0' ) THEN
         i4_2( 1 ) = ts48 + 1
        ELSE
         i4_2( 1 ) = ts48
        ENDIF
        ts = LSHIFT( i8, 28 )
        i8 = 0
        i4_2(1) = ts28
        ts = ts + i8

         IF ( first_ts.EQ.0 ) THEN
          first_ts = ts 
C          WRITE( 6, 9920 ) first_ts
C 9920     FORMAT( ' first ts: 0x', Z16.16 )
         ENDIF
... 334 more lines ...
Attachment 4: Untitled.png
Untitled.png
  325   Thu Jul 7 17:16:11 2016 Patrick Coleman-SmithInstall & Run new firmware with diagnostics for the SYNC
Connected to aidas from daresbury.
Started Firefox with windows for Rpi, Merge, Runcontrol.
Power on FEEs from Pi.
All power-up/RESET/SETUP ok.

Change Firmware from 0x17500C15 to 0x16600C1B which adds SYNC counters to the Master and Slave Timestamp logic.
Power cycle using Pi.
nnaida20 failed to start properly. Logged in as Root ok. TclHttpd Server not in place. dmesg showed no start of
XAIDA or XAIDAMEM. Instigated "reboot" by command line not power cycle. All fine after this completed.

Copied /MIDAS/TclHttpd/Html/AIDA to a backup copy.
Changed LOCAL.*, Aida.*, Check.*

System wide checks now has check SYNC counters.

Checked temperatures. Max is nnaida12 with 68.62

SYNC check :- All read 693248.
The Master counts how many SYNC pulses are issued. All Slaves count SYNC pulses received.
The Check uses a Re-Sync pulse generated without telling the Slaves a Re-Sync is due. The Master and Slaves copy
their SYNC counter to a register when the Re-SYNC occurs. These are read back and displayed.

Started the DAQ as setup with no transfer.
Good event rates range from 502Kitems/sec to 594K.

SYNC counter check : All 839680

Stop DAQ/ Enable Transfer/Merge in Pause/Go DAQ.
All the following times are DAQ local.

19:15 Good Events 414k to 484k across all 23 FEEs.

Stop DAQ/change all thresholds to 200/Go DAQ

Good Events 4.2K to 348K

19:38 Sync counter check :- 1653760 all
Temperatures Max nnaida12 69.12

20:01 Temperatures Max nnaida12 69.19
Sync counter check :- 2181120 all

20:19 Temperatures Max nnaida12 69.25
Sync counter check :- 2590720 all

21:09 Temperatures Max nnaida12 69.19
Sync counter check :- 3729408 all

21:23 Toggled Merge Pause. Doesn't SYNC.
Check statistics. SYNC rate ok, SYNC counts uneven. PAUSE range from 0 to 1158.
DAQ stop/zero statistics/restart Merge/setup and configure/Go Merge/Pause Merge/Go DAQ/Toggle Pause
21:43 Merging now at 4943691 items/sec.

Temperatures Max nnaida12 69.25

DL to RIKEN communication link failed.

23:02 Reconnect by firefox in ribfdaq.
Temperatures Max nnaida12 69.31
Merging still active.

23:20 Sync counter check :- 6722560 all
no sysc errors in system check.

23:45 Merger running 5.4M items/sec
Temperatures max nnaida12 69.44

Stop DAQ/Toggle Pause on Merger All ok.
23:51 Sync counter check 7439360 all except nnaida22 which returns -1
Retry Sync counter check 7478272 all except nnaida22 which returns -1

Log in to nnaida22, Servers look ok. AIDAExec running ok. dmesg has nothing extra.
Runcontrol shows nnaida22 as stopped.
Access nnaida22 alone and shows undefined.

Using diagnostics checked able to read registers correctly and read the sync counter register as 7478272.

Retry Sync counter check all 8006655 except nnaida22 which shows 8006388.

Check SYNC error counters and all ok except nnaida22 which has 6363 errors.

Finishing work ..... power off FEE64s using Pi, check a couple with ping from aidas1 .... fine.

Write up ELOG.

Repeat tests on Monday.
  324   Tue Jul 5 13:43:02 2016 DK, PVPulser Test Run after AIDA is moved

Needed to run startx and start the TcLHttpd server on rpi

Following along here: https://elog.ph.ed.ac.uk/AIDA/242

Can now access http://nnrpi1:8015/AIDA/Rly16/

Window at first connection reads:

Using /dev/ttyUSB0 firmware version 0

Using /dev/ttyUSB1 firmware version 0901

 

Looks good.  Sequence ALL On was selected

(By mistake, I then clicked Relay # to turn off...misread instructions...Switch all OFF, then sequence ALL on again)

7:51 they are turned on.  Waiting several minutes now.

Add 4 L of water to the chiller

8:12

 

Save/Control Settings from Hardware Control tab:

Selected: 2016Jun13-00.53.55 and Restore. (missing from elog 242)

 

From Run Control, Enable Histogramming, Data Transfer: Enable #1 All (missing from elog 242)

 

TapeServer / Merger / Httpd started on Workspace 2

 

TapeServer reports

Cannot issue more Access Tokens: Claim TapeServer returned:- 3 Capability table is full

Make a new directory for storing test data: /data10/TapeData/July2016

Merger: Setup and configure

Data Statistics shows about ~5 x 10^5 events in many nnaida.  Should we use a different setting file?

Now we try 2016Jun12-10.10.08

Perform ReSync

 

Probably event rate can be high because the SSD bias is off?  We set up Putty and turn on all biases to -100 V.

Attachments 1 and 2 show the bias settings after they are turned on.

 

Good Events rate improves, factor 2 to 4 (now about 100k roughly)

 

Pulser settings: around ~900 on dial, attenuator x10, x10, x2 switches down, 5x up.  Frequency ~250 Hz

 

21:19

Now we change the Control ASICs (select act on all ASICs, act on all FEEs)

Slow Comparator from 0x8 to 0x20

LEC/MEC Fast Comparator 0x4 to 0x80

Now the Good Events go down to about 10k to 25k roughly

 

TapeServer starts a run.

Merger started / paused / unpaused

 

Merger rate is 500k / sec.  TapeServer is 4 k / sec.

 

AIDA Syncs shown in Attachment 3

Temps in Attachment 4

Good Events Attachment 5

Run Number 1

Setting file: 2016Jul05-21.37.20 (Note: I wrote the wrong setting file name last night...this is the right one!)

 

Show sample pulser peak in Attachment 6, 7, 8

 

July 6 - 12:29: Stop the run.  114 run files, 2.0 GB each.

Attachments 9, 10 show biases at the end of the run

Attachment 1: bias1-2016-07-05_21.12.png
bias1-2016-07-05_21.12.png
Attachment 2: bias2_2016-07-05_21.12.png
bias2_2016-07-05_21.12.png
Attachment 3: aida_sync.png
aida_sync.png
Attachment 4: temps.png
temps.png
Attachment 5: good_evt.png
good_evt.png
Attachment 6: pulser_peak.png
pulser_peak.png
Attachment 7: pulse_2.15_nnaida13.png
pulse_2.15_nnaida13.png
Attachment 8: pulser-nnaida3.png
pulser-nnaida3.png
Attachment 9: bias1_run1_finish.png
bias1_run1_finish.png
Attachment 10: bias2_run1_finish.png
bias2_run1_finish.png
  323   Tue Jul 5 10:26:11 2016 DKAIDA relocation pictures

Here are some pictures after AIDA's relocation.

In principle, it can stay here, but probably after the tests we will push it further against the wall.  However, power and ethernet cabling cannot reach that far.

Attachment 1: Shows EURICA (original position) B-RIKEN moderator (near old AIDA spot) and AIDA (pushed downstream, rotated 90 degrees counterclockwise)

Attachment 2: Zoom in on AIDA from above.  Nominally the beam left downstream perspective.  Power supply is also seen on BRIKEN moderator table.

Attachment 3: Zoom in further of above, namely near FEE power supply rack

Attachment 4: More upstream view, still on beam left, showing tube with SSDs as well as water cooler in background

Attachment 5: Beam right view

Attachment 6: Beam right view from more upstream perspective, also showing the full tower of electronics.

Attachment 1: eurica-briken-aida.jpg
eurica-briken-aida.jpg
Attachment 2: beam-left-downstream.jpg
beam-left-downstream.jpg
Attachment 3: fee64-power.jpg
fee64-power.jpg
Attachment 4: beam-left-cooler.JPG
beam-left-cooler.JPG
Attachment 5: beam-right.JPG
beam-right.JPG
Attachment 6: beam-right-tower.jpg
beam-right-tower.jpg
  322   Tue Jul 5 09:39:27 2016 DKRly16 is not online

AIDA was relocated, and now I want to power it on to do some tests.  However, I cannot get the Rly16 service running.

Specifically, after powering up the entire AIDA system, on aidas1 PC, I cannot connect to the raspberry pi (nnrpi1) via a web browser

http://nnrpi1:8015/AIDA/Rly16/

The following elog may be relevant: https://elog.ph.ed.ac.uk/AIDA/58

Now I am ssh'd to nnrpi1 via aidas1

% dmesg | grep -A 3 USB0
[    8.970098] usb 1-1.3: pl2303 converter now attached to ttyUSB0
[    9.012479] usb 1-1.2.2: Detected FT232RL
[    9.178208] usb 1-1.2.2: FTDI USB Serial Device converter now attached to ttyUSB1
[    9.348288] ftdi_sio 1-1.2.3:1.0: FTDI USB Serial Device converter detected

So, I will try Patrick's workaround in elog #58

I tried it, though it's hard to know if it is done correctly.  Which item is which?  Or more importantly, which one runs Rly16?

[    8.107290] usb 1-1.2.3: USB disconnect, device number 8
[    8.261338] usbserial: USB Serial support registered for FTDI USB Serial Device
[    8.541012] ftdi_sio 1-1.2.2:1.0: FTDI USB Serial Device converter detected
[    8.712409] usb 1-1.2.2: Detected FT232RL
[    8.847955] usb 1-1.2.2: FTDI USB Serial Device converter now attached to ttyUSB0
[   12.252576] EXT4-fs (mmcblk0p2): re-mounted. Opts: (null)
[   12.728966] EXT4-fs (mmcblk0p2): re-mounted. Opts: (null)
[   23.695562] smsc95xx 1-1.1:1.0 eth0: hardware isn't capable of remote wakeup
[   25.212043] smsc95xx 1-1.1:1.0 eth0: link up, 100Mbps, full-duplex, lpa 0xC1E1
[   32.372015] Adding 102396k swap on /var/swap.  Priority:-1 extents:2 across:2134012k SSFS
[  109.219022] usb 1-1.2.3: new full-speed USB device number 9 using dwc_otg
[  109.335538] usb 1-1.2.3: New USB device found, idVendor=0403, idProduct=6001
[  109.335579] usb 1-1.2.3: New USB device strings: Mfr=1, Product=2, SerialNumber=0
[  109.335601] usb 1-1.2.3: Product: USB <-> Serial
[  109.335619] usb 1-1.2.3: Manufacturer: FTDI
[  109.354420] ftdi_sio 1-1.2.3:1.0: FTDI USB Serial Device converter detected
[  109.354761] usb 1-1.2.3: Detected FT232BM
[  109.356174] usb 1-1.2.3: FTDI USB Serial Device converter now attached to ttyUSB1
[  163.939476] usb 1-1.3: new full-speed USB device number 10 using dwc_otg
[  164.042348] usb 1-1.3: New USB device found, idVendor=067b, idProduct=2303
[  164.042424] usb 1-1.3: New USB device strings: Mfr=1, Product=2, SerialNumber=0
[  164.042446] usb 1-1.3: Product: USB-Serial Controller D
[  164.042468] usb 1-1.3: Manufacturer: Prolific Technology Inc.
[  164.096379] usbcore: registered new interface driver pl2303
[  164.098786] usbserial: USB Serial support registered for pl2303
[  164.099085] pl2303 1-1.3:1.0: pl2303 converter detected
[  164.106666] usb 1-1.3: pl2303 converter now attached to ttyUSB2

On ttyUSB# have:
FT232RL
FT232BM
pl2303


% lsusb
Bus 001 Device 002: ID 0424:9514 Standard Microsystems Corp.
Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
Bus 001 Device 003: ID 0424:ec00 Standard Microsystems Corp.
Bus 001 Device 004: ID 05e3:0608 Genesys Logic, Inc. USB-2.0 4-Port HUB
Bus 001 Device 010: ID 067b:2303 Prolific Technology, Inc. PL2303 Serial Port
Bus 001 Device 005: ID 046d:c016 Logitech, Inc. Optical Wheel Mouse
Bus 001 Device 007: ID 04f2:0402 Chicony Electronics Co., Ltd Genius LuxeMate i200 Keyboard
Bus 001 Device 006: ID 0403:6001 Future Technology Devices International, Ltd FT232 USB-Serial (UART) IC
Bus 001 Device 009: ID 0403:6001 Future Technology Devices International, Ltd FT232 USB-Serial (UART) IC

N.B. USB-serial link to FEE64 is not connected to anything on the serial side.  What is the FEE64 Console?  See Attachment 1.  It goes to nothing!  Should it go somewhere?

Attachment 1: usb_serial.JPG
usb_serial.JPG
  321   Sat Jun 18 14:35:31 2016 TDAIDA photographs - June 2016 - 4
Attachment 1: IMG_5854.JPG
IMG_5854.JPG
Attachment 2: IMG_5855.JPG
IMG_5855.JPG
Attachment 3: IMG_5856.JPG
IMG_5856.JPG
Attachment 4: IMG_5857.JPG
IMG_5857.JPG
Attachment 5: IMG_5863.JPG
IMG_5863.JPG
  320   Thu Jun 16 08:18:50 2016 TDAIDA photographs - May 2016
Attachment 1: IMG_5751.JPG
IMG_5751.JPG
Attachment 2: IMG_5721.JPG
IMG_5721.JPG
  319   Thu Jun 16 08:12:51 2016 TDAIDA photographs - June 2016 - 3
Attachment 1: IMG_5849.JPG
IMG_5849.JPG
Attachment 2: IMG_5850.JPG
IMG_5850.JPG
Attachment 3: IMG_5851.JPG
IMG_5851.JPG
ELOG V3.1.4-unknown