|
2004 CDF E-Log -- Owl shift. Wed Mar 3, 2004 |
| SciCo |
DAQ Ace |
Monitoring Ace |
CO |
(Operations Manager) |
| S.Miscetti |
A.Ivanov |
S.Sabik |
B.Mohr |
M.Convery |
Start of Shift Notes:  Shot setup for store # 3271 in progress
COT to be kept OFF, run JET_ST3_DECOUPLED[1,433,403]
Wed Mar 3 00:05:44
Run 179574
TERMINATE: junk junk - Andrew x2080
Wed Mar 3 00:14:08
Run 179575
TERMINATE: COT problems - Andrew x2080
Wed Mar 3 00:18:17
AntiProton have been loaded.
Ip = 9400E9
Ipbar = 980E9
- S.Miscetti
Wed Mar 3 00:19:57
TOF heart beat. SMACS hung. Killed it. Restarted it. - Simon
Wed Mar 3 00:20:59
Run 179576
ACTIVATE: JET_ST3_DECOUPLED [1,433,403]
shot setup
COT crates are out - Andrew x2080
Wed Mar 3 00:34:48
Scraping done
Ip=8700E9, IPb=935E9
Lost P = 7KHz
Lost Pbar=480 Hz - S.Miscetti
Wed Mar 3 00:37:11
set prescale bit 1 (L1_JET3_PS1) to 10.
It significantly reduced dead time.
- Andrew :: (run 179576)
Wed Mar 3 00:37:47
CLC is up.
Initial Luminosity : 5.4E31
MCR has been informed. - S.Miscetti
Wed Mar 3 00:38:32
Run 179576
TERMINATE: turning ON high voltages - Andrew x2080
Wed Mar 3 00:45:32
Run 179577
ACTIVATE: shot setup DAQ test
JET_ST_2_DECOUPLED [1,433,403]
COT crates are out
all HV are ON excpet for COT and TOF - Andrew x2080
Wed Mar 3 00:56:36
COT Gas Alarm. Cryo tech is taking care of it.
- S.Miscetti
Wed Mar 3 00:58:21
TOF HV failed to ramp up. PC said: "could not communicate with device" or something like that. But there was no heart beat alarm on IFIX. Restarted SMACS and it said the same error message. Restarted TOF PC and it seems to be running fine now. - Simon
-- Wed Mar 3 01:07:33 comment by...Simon -- IFIX can't communicate with TOF. The TOF PC seems to be running fine. But the TOF alarm on IFIX
has been gray for more than 10 minutes.
-- Wed Mar 3 01:18:35 comment by...Simon -- Connection still not established with node.
Wed Mar 3 00:58:34
 | Note off-diagonal entries in Trig-Mon L1 Cal.
Paged ADMEM expert. Mark Mattson replied -- he thinks it is a monitoring issue. |
- Brian
-- Wed Mar 3 01:29:47 comment by...m mattson --
First of all, I'd like to commend whoever set up the "recent consumer plots" on the CO Help page. The Sci-Co directed me there, and pointed out the plots they were looking at.
I'm not sure what is up with these plots. If you look at this one (1.3.3) and the L1 Dirac Occupancy (1.3.2), it implies that there are absolutely *no* EM entries. The plot I usually look at is the 125 MeV DCAS Occupancy (1.3.4), which looks good (normal EM occupancy).
The Sci-Co asked if this might be related to what Bill B did earlier to recover ccal05. If so, I would expect problems in 3 phi on the west side, and not for the entire central calorimeter.
The next question was, should anyone else be consulted? As it is 1 AM, and there are no other "symptoms" to indicate a problem, I suggested it is enough to make a note in the e-log. Experts can look into this tomorrow.
-- Wed Mar 3 01:30:57 comment by...S.Miscetti --
The only two persons working on the whole Central were Fotis and
B.Badgett. From the above plot looks that only the EM is having
problem with the trigger, while the Display shows also a multiplicity of small hits on CHA. Fotis
was working with the Laser and the only reason that could affect all the Central is LER downloading.
We will try to get in touch with both of them.
-- Wed Mar 3 01:33:09 comment by...rainer -- person to be commended for web consumer plots is Charles Plager. I agree he did a nice job indeed.
-- Wed Mar 3 01:43:03 comment by...m mattson --
While I conceed that it could be LERs or some database mismatch. (I don't know what actions were taken earlier.) I'm leaning towards monitoring/software. I don't understand how you can have zero EM occupancy at L1, but normal EM occupancy at L2.
-- Wed Mar 3 01:59:27 comment by...S,Miscetti -- I talk to Fotis and I correc the above statement. Fotis did not
touch the LER setting; this is not required for Laser Cal!
LED are working properly has checked by HATD plot for CHA.
-- Wed Mar 3 09:42:24 comment by...carla -- This problem has been noted a number of times already.
There is an option to control the hardware spike killer for CEM . The way this is controlled changed
recently
as for JDL request. The code to tell the trigger simulation
if the spike killer is enable or not exists and it's finding
its way to Trigmon.
If the errors appear in a run with a TEST trigger cable you
they are not real.
Wed Mar 3 00:58:37
 | circular pattern in calo - qcd instanton vortex bubble ? |
- burkard,rainer
Wed Mar 3 01:04:30
current L1 trigger rates after prescale (Fred Prescaled)
L1_JET3_PS1[1] / 10 12375 Hz
L1_MB_CLCPS50K[1]/50000 25 Hz
L1_MB_XING_PS1M[2]/1000003 1.6 Hz
L1_JET3_PS1M[1] /1000003 0.2 Hz
Total L1 12240 Hz
L2 102 Hz
- rainer :: (run 179577)
-- Wed Mar 3 01:05:44 comment by...rainer -- still determining whether calo is correct and rates can be taken serious ...
-- Wed Mar 3 01:10:35 comment by...Rainer -- L1_JET2_PS1[1] /6 = 16.830 Hz.
Total L1=20087 Hz.
Wed Mar 3 01:19:14
ObjectMon has no data. Plots are not filling with events. I have killed the Mon and
restarted, but this did help. I am trying again.
- Brian
Wed Mar 3 01:30:52

- Simon
-- Wed Mar 3 01:31:18 comment by...Simon -- Losses and SVRAD during shot
Wed Mar 3 01:40:23



- Simon
Wed Mar 3 01:40:46
 | TrigMon L1 Sum Et at Stefano's request |
- Brian
Wed Mar 3 01:42:27
 | LOSTP spike |
- Simon
Wed Mar 3 02:05:47
Run 179577
TERMINATE: ending to run to include silicon for Mario Martinez Run. - Andrew x2080
Wed Mar 3 02:10:34
Run 179578
Activated at 2004.03.03 02:09:52 - RunControl
Wed Mar 3 02:13:30
Run 179578
ACTIVATE: JET_ST3_DECOUPLED[1,433,403] COT crates dropped, TOF inhibit masked
initial prescale L1_JET3_PS1[1] /10 - Andrew x2080
-- Wed Mar 3 02:17:03 comment by...Rainer -- Setting is now
L1_JET3_PS1[1] / 5 20900 Hz
L1_MB_CLCPS50K[1]/50000 22 Hz
L1_MB_XING_PS1M[2]/1000003 1.8 Hz
L1_JET3_PS1M[1] /1000003 0.1 Hz
Total L1 21000 Hz
L2 100 Hz
Wed Mar 3 02:29:11
TOF was grey in the iFix Alarm window.
TOF PC downstairs reported: ' SUMMARY LIST is overflowed'
I restarted PC and went through all procedures to start SMACS
but once I hit 'play' button. I get the same error:
'SUMMARY LIST is overflowed'
TOF is back green now, however heart bit remains grey.
We take no action about this problem till the morning.
- Andrew
-- Wed Mar 3 02:42:19 comment by...Andrew -- TOF heart bit is back green. Everrything is fine. We leave TOF
off for this run.
Wed Mar 3 02:40:05
| Please note: Rainer and I manually switched SVXMon
to the cosmic setup for the tests tonight, because the data in not in stream A. At the moment all
other Mons are running in physics mode. SVXMon will need to be switched back to physics mode when
we begin taking physics data. (Just source the script to put all consumers in physics
mode.) |
- Brian
Wed Mar 3 02:49:40
CER_SVXMON_HALT_RECOVER_RUN_ERROR
b0dap84.fnal.gov:ConsumerErrorRe:2:37:01 AM->Runtime Error 2, Event 301, R
unNum 179578: SvxMon Halt Recover Run: Pipeline out of synch in 135 silicon read
out chips .
SCPU_BAD_CHANNEL_COUNTS Error
Hardware EVB has detected a problem with data quality in
SCPU b0eb23.
b0l3pcom1.fnal.gov:main:2:44:48 AM->Host b0eb23.fnal.gov, task tRec_0
SCPU-P1-E-BadChannelCounts: In event 207985 the sum of channel counts is inconsi
stent with the total count, VRB slot 20.
b0l3pcom2.fnal.gov:main:2:44:49 AM->Error on L3 node b0l3098 (partition 1)
Wed Mar 3 02:44:48 2004 l3_node
in refoInt_reformatProc (l3_refevt.c:610)
( L3_REFORMAT_ERROR: see Error Handler _reformatter.log file for full message)
In both cases HRR worked. - Andrew :: (run 179578)
Wed Mar 3 02:52:42
Silicon Status for ST3 test
adjusted prescale to ST3 trigger to be below 20 kHz. re-started SVXMon in cosmics mode since we are logging into stream I. several glinks seemed to be unsynch'ed in ISL, but HRR cured it. now seems to be taken data fine and safely. - Rainer :: (run 179578)
-- Wed Mar 3 02:55:19 comment by...rainer -- see silicon
elog
Wed Mar 3 03:05:30



- Simon
Wed Mar 3 03:06:27
| Please Note: I also switched ObjectMon to run in
cosmic mode. Kaori confirmed that both ObjectMon and SVXMon read only stream A in physics mode. We
do not have stream A in the current configuration. |
- Brian
-- Wed Mar 3 03:15:46 comment by...kaori -- Please note that even though ObjectMon now gets events because
it is getting 'any' triggers, we probably find that many of the
ObjectMon plots would be 'red'. It is because reference plots
for ObjectMon is now set to 'cosmic'. If this is real problem
for the shift crew, please call me again.
-- Wed Mar 3 03:45:30 comment by...Brian -- While speaking with Kaori, I created a new .tcl file for ObjectMon when running with the jet
trigger. It has the physics reference plots but picks up all data streams. Note that the jet
trigger data does not quite match the references, but it is within what we expect.
The .tcl file is "ObjectMon_jet.tcl" and is located with the other ObjectMon .tcl files. This file
will need to be set manually when running in this mode again.
Wed Mar 3 04:53:04
PSM HV alarm. Recovered by itself. - Simon :: (run 179578)
Wed Mar 3 05:06:23



- Simon
Wed Mar 3 07:14:15



- Simon
Wed Mar 3 07:58:02
There were lots of persistent errors during last few hours:
L1Mon: saw 210 L1 DMA transfers, expect 1 (buffer number 1)
All of them were auto-HRR receoverd.
- Andrew :: (run 179578)
Wed Mar 3 08:01:38
Shift started with shot setup in progress for run#3271
Plan was to take data with JET_ST3_DECOUPLED[1,433,403]
with Silicon and then test TEST_MET15_DECOUPLED[1,434,403].
COT has to be OFF.
Scraping done at 0:35 am.
Ip/pbar = 8700E9 / 935E9, Lostp/Lostpbar = 7 / 0.5 Khz
Initial Luminosity L = 5.4E31
Gas alarm reset by Cryo.
Test of JET_ST3_DECOUPLED[1,433,403] without COT crates,
Silicon IN.
Initial prescale L1_JET3_PS1[1]/10. RUN 179578.
The prescales have been fixed to have a rate < 20 KHz
(L1= 18KHz, L2=100Hz)
Rainer fixed several glinks unsynch'ed in ISL.
Still it is not yet clear if this is a valid test.
Trigmon shows a not fully diagonal correlation
between DCAS trigger tower ET and ADMEM Et.
This is only in the central EM. Similarly(?) the
display shows anomalous hits on the CEM, CHA.
This has to be investigated more carefully today.
Consumer processes were also having some problems
since the data do not go on stream A:
SVXMON and OBJECTMON run now in cosmic mode.
Check Kaori comments on the e-log.
- S.Miscetti :: (run 179578)
-- Wed Mar 3 10:45:06 comment by...Burkard -- At the beginning of the shift Pulsar Guys were still working on the fibers splitters, details about that can be found in
the pulsar elog