2004 CDF E-Log -- Owl shift. Fri Mar 5, 2004
SciCo DAQ Ace Monitoring Ace CO (Operations Manager)
S.Miscetti N.Miladinovic V.Khotilovich B.Mohr M.Convery


Start of Shift Notes:  

Store # 3273 in progress. Lum=1.7E31 

Fri Mar 5 00:52:03 We have lost TOF heartbeat and I restarted the Smacs using the preset settings, as the instruction said us to do during datataking. Smacs started fine but we had a TOF HV trip. We paged the expert and he advised to restart the Smacs with not preset parameters (as we should do when we not taking data). I turned the TOF HV on and it came back after some time. - Vadim :: (run 179644)
Fri Mar 5 00:53:27 There was a short COT temperature problem which cleared itself really fast - Vadim :: (run 179644)
Fri Mar 5 01:08:13
 - Vadim
-- Fri Mar 5 01:09:07 comment by...Vadim --  Hourly plots. Abort gaps began their way up.
Fri Mar 5 01:57:52
 - Vadim
-- Fri Mar 5 02:03:44 comment by...Vadim --  
These are I_AVDD_B5W1L2 and I_AVDD_B4W3L2 cells - they stay pink after unmarking and HRRing.
The second one was like that since long time, but the first one is new. Will page silicon.


There were two more pink cells in SVX: S_AVDD_B2W7L4 and S_AVDD_B4W8L3 but they are doing well after
unmarking

-- Fri Mar 5 02:18:57 comment by...Vadim --  Typo: I_AVDD_B5W1L2 should be I_AVDD_B5W1L1 Also IMON crashed when I was closing some history window. I restarted it.
-- Fri Mar 5 02:57:54 comment by...Brian --  
I spoke with silicon expert.  No SVXMon cells are yellow, so for the time being, we will just
continue monitoring.

Fri Mar 5 02:26:45 Around 20' ago we had a COT "sound" alarm related to flow rate on COTBUBSUP (low lim 13.00 scfh now 12.97) immediatly silenced by the Cryo-tech. I called the Cryo which informed us that is taking care of it.  - S.Miscetti
Fri Mar 5 02:35:05
 - Hourly pics. Vadim
Fri Mar 5 03:13:48
 - Vadim
Fri Mar 5 03:26:21 Run 179644 ACTIVE:
Got the following error:
 Attention !!!. CER_SVXMON_HALT_RECOVER_RUN_ERROR !!! 
 Stuck Cellid S/B1/W5/L4/C7-13 . 
HRR worked.
 - natasha x2080
Fri Mar 5 04:01:28
HVAC alarm. Cryo is investigating: 
EF-2 (EF-2 Flow) > 400 CFM now we have 375. 
It seems to be related to the storm passing trough! 
I told the cryo tech to keep us informed. 
 - S.Miscetti
Fri Mar 5 04:04:05
 - Brian
-- Fri Mar 5 04:06:58 comment by...Brian --  
There are a number of consecutive hot channels in CES East raw ADC counts, wedge 21.  The
number of hot channels (~14) is less than the maximum allowed for good run (32) in CO check list.  I
will continue to monitor the situation.


Also, low band in wedge 20 & 21 in CPR East.

-- Fri Mar 5 07:41:39 comment by...Larry Nodulman --  Counting is a much better diagnostic than average ph which is probably highligting some coherent noise. The color plots are quite variable about highlighting differences.
Fri Mar 5 04:10:23
 - (H.P.) Vadim
Fri Mar 5 04:16:02
History_I_AVDD_B5W0L0 got pink and unmarking, and HRRing doesn't help. SVXMon doesn't seam to be complaining... I guess we need to page again
 - Vadim
-- Fri Mar 5 04:23:54 comment by...Vadim --  will keep an eye on it to see if it will increase
-- Fri Mar 5 04:35:56 comment by...Vadim --  Ok, after the fourth try to unmark it continues to stay yellow for already 10 min!
Fri Mar 5 05:07:38
 - (H.P.) Vadim
Fri Mar 5 05:32:36
While discussing with the Cryo tech, reporting that Bill 
Noe has lowered the alarm for COT COTBUBSUP (LL from 13 
to 12 scfm), the Ace smelt something bad and I went upstairs 
to check the L3 room. I found some water spilling from the 
ceiling. Cryo tech put some  temporary plastic and is advising 
the building manager.
 - S.Miscetti
Fri Mar 5 05:49:46
We went upstairs again. The temperature is growing. 
We have 28 degrees in L3 room and Transformer b0-14ABC 
looks really warm. Building manager is on the way.
 - S.Miscetti
-- Fri Mar 5 06:01:11 comment by...S.Miscetti --  
Under the directions of OPmanager we paged also L3 and DAQ
people just to inform that of the situation. Derwin Allen and
Steve Hahn were also informed. Fortunately the Building Manager
arrived convinving us that 82F is the "normal" temperature.
Hope everything goes fine.

Fri Mar 5 06:07:29
 - (H.P.) Vadim
Fri Mar 5 06:24:16 Run 179644 ACTIVE:
Attention !!!. CER_SVXMON_HALT_RECOVER_RUN_ERROR !!! 
 Stuck Cellid S/B1/W5/L4/C7-13 . 
 AUTO HRR worked.
 - natasha x2080
Fri Mar 5 07:27:49
 - (Hr.Pl.) Vadim
Fri Mar 5 07:44:07
At the previous CO's request, I write this note is to introduce DQMon -- the software to
monitor the consumers that may eventually replace the CO.  DQMon has been running since Diego's
shift yesterday in a trial/test period.  According to Diego, it crashed once during his shift.  I
noticed a few read errors and other minor things, but the software ran all night.
 - Brian
Fri Mar 5 07:53:34 Run 179644 ACTIVE: CER_SVXMON_HALT_RECOVER_RUN_ERROR: Stuck Cellid S/B1/W5/L4/C7-13 . AUTO HRR will be issued - it helped - natasha x2080
Fri Mar 5 07:55:26
Run Number Data Type Physics Table Begin Time End Time Live Time L1 Accepts L2 Accepts L3 Accepts Live Lumi, nb-1 GR SC RC
179644 x2BDBC BEAM PHYSICS_2_02 [2,424,431] 21:35:04 09:49:51 354,199,504 4,975,677 1,227,721 550.859 1
Totals 07:55:02 09:49:51 354,199,504 4,975,677 1,227,721 550.859
 - End of Shift Report
Fri Mar 5 07:56:07
Temperature plot for 1st, 2nd, 3rd floor. Apparently, temperature excursions up to 83 F. have not been uncommon over the last week.
 - Steve Hahn
-- Fri Mar 5 08:03:48 comment by...Steve Hahn --  I have lowered the heating/cooling setpoints for AC-9 by 2 F. Let's see if this helps, though it also means the 3rd floor offices will be cooler.
Fri Mar 5 08:00:17 Shift Summary:
Shift started with store # 3273 (Lum 1.7E31) and run
179644 in progress; running with COT in Slightly Degraded Mode PHYSICS_2_02[2,424,4]. Initial/final
Shift Luminosity 1.7E31/1.26E31.  


- After the usual midnight heartbit problem, we got a 25'  
delay in recovering a TOF HV trip. ACE fixed it by restarting "without preset  parameters" under
direction of the TOF expert  

on call. 

- Silicon:  
 IMON problem with one ISL channel (I_AVDD_B5W1L1), no  
 problems by SVXMON; Two auto-HRR on CelliD: B1/w5/L4/C7-13.  

- DQMon test looks fine (it often complains that "Ymon ouput seems to be empty"). 

- Alarms: 
  i) 2:30 COT  flow rate alarm for COT-BUBSUP.   
  The flow rate reached the low limit of 13.0 scfm and  
  remained in this condition for the whole duration of  
  the shift. Bill Noe lowered the setting to 12.0 scfm.  

  ii) 5:00 HVAC alarm. Low flow rare on EF-2, related  
  to the storm; 

  iii) 5:05 ACEs/C0s were smelling  "electrical" odors in  
the control room and  e went upstairs to check L3 room.  
We found some water leaking from the ceiling on top of a Electrical panel. The Cryo tech covered it
with some  

plastic and we informed OPS and Building manager. 


End of Shift Numbers
CDF Run II

Runs                   179644
Delivered Luminosity   409  
Acquired Luminosity    381  
Efficiency             93

 - S.Miscetti
Fri Mar 5 08:02:59
L2 Decision Timeout ERROR: 

like last week already: 

L1Mon: saw 210 L1 DMA transfers, expect 1 (buffer number 0) 
L1Mon: Dumping data for  1 word. 
Word       upper 32 bits  lower 32 bits 
   0: 0x00000000	0x01000000  
   1: 0x00000000	0x40001818  
   2: 0x0083b477	0xa8948879  

.... 
  418: 0x0083b477	0xa8948879  
  419: 0x8083b477	0x88948879  
L1Mon: done. 
 - Jan :: (run 179644)
Fri Mar 5 08:05:36
 - (Hr.Pl.) Vadim