2004 CDF E-Log -- Owl shift. Wed Mar 3, 2004
SciCo DAQ Ace Monitoring Ace CO (Operations Manager)
S.Miscetti A.Ivanov S.Sabik B.Mohr M.Convery


Start of Shift Notes:  

Shot setup for store # 3271 in progress 
COT to be kept OFF, run JET_ST3_DECOUPLED[1,433,403]

Wed Mar 3 00:05:44 Run 179574 TERMINATE: junk junk - Andrew x2080
Wed Mar 3 00:14:08 Run 179575 TERMINATE: COT problems - Andrew x2080
Wed Mar 3 00:18:17
AntiProton have been loaded. 
Ip = 9400E9 
Ipbar = 980E9 
 - S.Miscetti
Wed Mar 3 00:19:57 TOF heart beat. SMACS hung. Killed it. Restarted it. - Simon
Wed Mar 3 00:20:59 Run 179576 ACTIVATE: JET_ST3_DECOUPLED [1,433,403] shot setup COT crates are out - Andrew x2080
Wed Mar 3 00:34:48 Scraping done Ip=8700E9, IPb=935E9 Lost P = 7KHz Lost Pbar=480 Hz  - S.Miscetti
Wed Mar 3 00:37:11
set prescale bit 1 (L1_JET3_PS1) to 10.  
It significantly reduced dead time.
 - Andrew :: (run 179576)
Wed Mar 3 00:37:47 CLC is up. Initial Luminosity : 5.4E31 MCR has been informed. - S.Miscetti
Wed Mar 3 00:38:32 Run 179576 TERMINATE: turning ON high voltages - Andrew x2080
Wed Mar 3 00:45:32 Run 179577 ACTIVATE: shot setup DAQ test JET_ST_2_DECOUPLED [1,433,403] COT crates are out all HV are ON excpet for COT and TOF - Andrew x2080
Wed Mar 3 00:56:36
COT Gas Alarm. Cryo tech is taking care of it.
 - S.Miscetti
Wed Mar 3 00:58:21 TOF HV failed to ramp up. PC said: "could not communicate with device" or something like that. But there was no heart beat alarm on IFIX. Restarted SMACS and it said the same error message. Restarted TOF PC and it seems to be running fine now. - Simon
-- Wed Mar 3 01:07:33 comment by...Simon --  
IFIX can't communicate with TOF. The TOF PC seems to be running fine. But the TOF alarm on IFIX
has been gray for more than 10 minutes.

-- Wed Mar 3 01:18:35 comment by...Simon --  
Connection still not established with node.

Wed Mar 3 00:58:34
Note off-diagonal entries in Trig-Mon L1 Cal. Paged ADMEM expert. Mark Mattson replied -- he thinks it is a monitoring issue.
 - Brian
-- Wed Mar 3 01:29:47 comment by...m mattson --  
First of all, I'd like to commend whoever set up the "recent consumer plots" on the CO Help page. The Sci-Co directed me there, and pointed out the plots they were looking at.

I'm not sure what is up with these plots. If you look at this one (1.3.3) and the L1 Dirac Occupancy (1.3.2), it implies that there are absolutely *no* EM entries. The plot I usually look at is the 125 MeV DCAS Occupancy (1.3.4), which looks good (normal EM occupancy).

The Sci-Co asked if this might be related to what Bill B did earlier to recover ccal05. If so, I would expect problems in 3 phi on the west side, and not for the entire central calorimeter.

The next question was, should anyone else be consulted? As it is 1 AM, and there are no other "symptoms" to indicate a problem, I suggested it is enough to make a note in the e-log. Experts can look into this tomorrow.
-- Wed Mar 3 01:30:57 comment by...S.Miscetti --  

The only two persons working on the whole Central were Fotis and
B.Badgett. From the above plot looks that only the EM is having
problem with the trigger, while the Display shows also a multiplicity of small hits on CHA. Fotis
was working with the Laser and the only reason that could affect all the Central is LER downloading.
We will try to get in touch with both of them.


-- Wed Mar 3 01:33:09 comment by...rainer --  person to be commended for web consumer plots is Charles Plager. I agree he did a nice job indeed.
-- Wed Mar 3 01:43:03 comment by...m mattson --  
While I conceed that it could be LERs or some database mismatch. (I don't know what actions were taken earlier.) I'm leaning towards monitoring/software. I don't understand how you can have zero EM occupancy at L1, but normal EM occupancy at L2.
-- Wed Mar 3 01:59:27 comment by...S,Miscetti --  
I talk to Fotis and I correc the above statement. Fotis did not
touch the LER setting; this is not required for Laser Cal!
LED are working properly has checked by HATD plot for CHA.

-- Wed Mar 3 09:42:24 comment by...carla --  
This problem has been noted a number of times already.
There is an option to control the hardware spike killer for CEM . The way this is controlled changed
recently

as for JDL request. The code to tell the trigger simulation 
if the spike killer is enable or not exists and it's finding 
its way to Trigmon.
If the errors appear in a run with a TEST trigger cable you 
they are not real.

Wed Mar 3 00:58:37
circular pattern in calo - qcd instanton vortex bubble ?
 - burkard,rainer
Wed Mar 3 01:04:30 current L1 trigger rates after prescale (Fred Prescaled)

 
L1_JET3_PS1[1]   / 10    12375 Hz 
L1_MB_CLCPS50K[1]/50000     25 Hz 
L1_MB_XING_PS1M[2]/1000003   1.6 Hz 
L1_JET3_PS1M[1]   /1000003   0.2 Hz 

Total L1 12240 Hz 
      L2   102 Hz 
 - rainer :: (run 179577)
-- Wed Mar 3 01:05:44 comment by...rainer --  still determining whether calo is correct and rates can be taken serious ...
-- Wed Mar 3 01:10:35 comment by...Rainer --  L1_JET2_PS1[1] /6 = 16.830 Hz. Total L1=20087 Hz.
Wed Mar 3 01:19:14
ObjectMon has no data.  Plots are not filling with events.  I have killed the Mon and
restarted, but this did help. I am trying again.
 - Brian
Wed Mar 3 01:30:52
 - Simon
-- Wed Mar 3 01:31:18 comment by...Simon --  Losses and SVRAD during shot
Wed Mar 3 01:40:23
 - Simon
Wed Mar 3 01:40:46
TrigMon L1 Sum Et at Stefano's request
 - Brian
Wed Mar 3 01:42:27
LOSTP spike
 - Simon
Wed Mar 3 02:05:47 Run 179577 TERMINATE: ending to run to include silicon for Mario Martinez Run. - Andrew x2080
Wed Mar 3 02:10:34 Run 179578 Activated at 2004.03.03 02:09:52 - RunControl
Wed Mar 3 02:13:30 Run 179578 ACTIVATE: JET_ST3_DECOUPLED[1,433,403] COT crates dropped, TOF inhibit masked initial prescale L1_JET3_PS1[1] /10 - Andrew x2080
-- Wed Mar 3 02:17:03 comment by...Rainer --   Setting is now
L1_JET3_PS1[1]   / 5       20900 Hz 
L1_MB_CLCPS50K[1]/50000       22 Hz 
L1_MB_XING_PS1M[2]/1000003   1.8 Hz 
L1_JET3_PS1M[1]   /1000003   0.1 Hz 

Total L1 21000 Hz 
      L2   100 Hz 

Wed Mar 3 02:29:11
TOF was grey in the iFix Alarm window. 
TOF PC downstairs reported: ' SUMMARY LIST is overflowed' 
I restarted PC and went through all procedures to start SMACS 
but once I hit 'play' button. I get the same error: 
'SUMMARY LIST is overflowed' 
TOF is back green now, however heart bit remains grey.  
We take no action about this problem till the morning.
 - Andrew
-- Wed Mar 3 02:42:19 comment by...Andrew --  
TOF heart bit is back green. Everrything is fine. We leave TOF
off for this run.

Wed Mar 3 02:40:05
Please note:  Rainer and I manually switched SVXMon
to the cosmic setup for the tests tonight, because the data in not in stream A.  At the moment all
other Mons are running in physics mode.  SVXMon will need to be switched back to physics mode when
we begin taking physics data.  (Just source the script to put all consumers in physics
mode.)
 - Brian
Wed Mar 3 02:49:40
CER_SVXMON_HALT_RECOVER_RUN_ERROR 

b0dap84.fnal.gov:ConsumerErrorRe:2:37:01 AM->Runtime Error 2, Event 301, R 
unNum 179578: SvxMon Halt Recover Run: Pipeline out of synch in 135 silicon read 
out chips . 

SCPU_BAD_CHANNEL_COUNTS Error 

Hardware EVB has detected a problem with data quality in 
 SCPU b0eb23. 
b0l3pcom1.fnal.gov:main:2:44:48 AM->Host b0eb23.fnal.gov, task tRec_0 
SCPU-P1-E-BadChannelCounts: In event 207985 the sum of channel counts is inconsi 
stent with the total count, VRB slot 20. 
b0l3pcom2.fnal.gov:main:2:44:49 AM->Error on L3 node b0l3098 (partition 1) 
 Wed Mar 3 02:44:48 2004 l3_node 
  in refoInt_reformatProc (l3_refevt.c:610) 
     ( L3_REFORMAT_ERROR: see Error Handler _reformatter.log file for full message) 

In both cases HRR worked.
 - Andrew :: (run 179578)
Wed Mar 3 02:52:42 Silicon Status for ST3 test

adjusted prescale to ST3 trigger to be below 20 kHz. re-started SVXMon in cosmics mode since we are logging into stream I. several glinks seemed to be unsynch'ed in ISL, but HRR cured it. now seems to be taken data fine and safely.  - Rainer :: (run 179578)
-- Wed Mar 3 02:55:19 comment by...rainer --  see silicon elog


Wed Mar 3 03:05:30
 - Simon
Wed Mar 3 03:06:27
Please Note:  I also switched ObjectMon to run in
cosmic mode.  Kaori confirmed that both ObjectMon and SVXMon read only stream A in physics mode.  We
do not have stream A in the current configuration.
 - Brian
-- Wed Mar 3 03:15:46 comment by...kaori --  Please note that even though ObjectMon now gets events because it is getting 'any' triggers, we probably find that many of the ObjectMon plots would be 'red'. It is because reference plots for ObjectMon is now set to 'cosmic'. If this is real problem for the shift crew, please call me again.
-- Wed Mar 3 03:45:30 comment by...Brian --  
While speaking with Kaori, I created a new .tcl file for ObjectMon when running with the jet
trigger.  It has the physics reference plots but picks up all data streams.  Note that the jet
trigger data does not quite match the references, but it is within what we expect.


The .tcl file is "ObjectMon_jet.tcl" and is located with the other ObjectMon .tcl files.  This file
will need to be set manually when running in this mode again.

Wed Mar 3 04:53:04 PSM HV alarm. Recovered by itself. - Simon :: (run 179578)
Wed Mar 3 05:06:23
 - Simon
Wed Mar 3 07:14:15
 - Simon
Wed Mar 3 07:58:02
There were lots of persistent errors during last few hours: 

L1Mon: saw 210 L1 DMA transfers, expect 1 (buffer number 1) 

All of them were auto-HRR receoverd.
 - Andrew :: (run 179578)
Wed Mar 3 08:01:38
Shift started with shot setup in progress for run#3271 
Plan was to take data with JET_ST3_DECOUPLED[1,433,403]  
with Silicon and then test TEST_MET15_DECOUPLED[1,434,403].  
COT has to be OFF. 

Scraping done at 0:35 am.  
Ip/pbar = 8700E9 / 935E9, Lostp/Lostpbar = 7 / 0.5 Khz 
Initial Luminosity L = 5.4E31 

Gas alarm reset by Cryo. 

Test of JET_ST3_DECOUPLED[1,433,403] without COT crates,  
Silicon IN. 
Initial prescale L1_JET3_PS1[1]/10. RUN 179578.  
The prescales have been fixed to have a rate < 20 KHz  
(L1= 18KHz, L2=100Hz) 
Rainer fixed several glinks unsynch'ed in ISL. 

Still it is not yet clear if this is a valid test.  
Trigmon shows a not fully diagonal correlation  
between DCAS trigger tower ET and ADMEM Et. 
This is only in the central EM. Similarly(?) the  
display shows anomalous hits on the CEM, CHA.  
This has to be investigated more carefully today. 

Consumer processes were also having some problems  
since the data do not go on stream A:  
SVXMON and OBJECTMON run now in cosmic mode.  
Check Kaori comments on the e-log. 
 - S.Miscetti :: (run 179578)
-- Wed Mar 3 10:45:06 comment by...Burkard --  At the beginning of the shift Pulsar Guys were still working on the fibers splitters, details about that can be found in the pulsar elog