Re: Not able to upload dataset using proofd

From: Gerri Ganis <gerardo.ganis_at_cern.ch>
Date: Mon, 22 Jan 2007 16:16:21 +0100

Hi Leandro,

As Antonio says, for the case of a non-local cluster based on {proofd, rootd} there is actually
a problem with uploading datasets.
I have found the origine and a fix should be ready very soon.

Cheers, Gerri

Antonio Bulgheroni wrote:

> Yes sure, this is what I'm doing and it is working since I'm using a
> single machine. But what about the case of a real cluster? Sooner or
> later I need to move in that direction. Anyhow it is working with
> xpd... so no problem.
>
> Thanks,
>
> Cheers,
>
> Antonio
>
> --
> Antonio Bulgheroni, PhD
>
> It is ok to be ignorant in some areas,
> but some people abuse the privilege.
>
> On 1/22/07, *Leandro Franco* <lfranco_at_cern.ch
> <mailto:lfranco_at_cern.ch>> wrote:
>
> Hi...
>
> If you are working only in a local session I don't think you need
> to upload the files to the cluster (they are in your machine already)
>
> See this example:
>
> http://root.cern.ch/twiki/bin/view/ROOT/XpdExampleone
>
> or try something similar with a selector:
>
>
> root [0] TProof *proof = TProof::Open("")
>
> root [1] .x ~/dev/proof/h1chain.C
>
> root [2] TDSet *dset = new TDSet(chain)
>
> root [3]
> dset->Process("/Users/lfranco/dev/root/root/tutorials/tree/h1analysis.C")
> Starting h1analysis with process option:
> Looking up for exact location of files: OK (4 files)
> Validating files: OK (4 files)
> Master-0: grand total: sent 3 objects, size: 5456 bytes
> FCN=952.072 FROM MIGRAD STATUS=CONVERGED 220 CALLS
> 221 TOTAL
> EDM=1.37809e-08 STRATEGY= 1 ERROR
> MATRIX ACCURATE
> EXT PARAMETER STEP
> FIRST
> NO. NAME VALUE ERROR SIZE
> DERIVATIVE
> 1 p0 9.59988e+05 9.07052e+04 7.92857e+01
> -2.69327e-09
> 2 p1 3.51130e-01 2.32881e-02 4.69706e-05
> 5.29428e-03
> 3 p2 1.18502e+03 5.95939e+01 6.72112e-01
> 2.29626e-06
> 4 p3 1.45569e-01 5.93851e-05 8.69320e-07
> -1.75026e+00
> 5 p4 1.24388e-03 6.63103e-05 7.86533e-07
> -6.72392e-01
> (Long64_t)0
> root [4]
>
>
>
> where ~/dev/proof/h1chain.C :
>
> {
> TChain chain("h42");
> chain.Add("~/tmp/h1/dstarmb.root"); // 21330730 bytes 21920
> events
> chain.Add("~/tmp/h1/dstarp1a.root"); // 71464503 bytes 73243
> events
> chain.Add("~/tmp/h1/dstarp1b.root"); // 83827959 bytes 85597
> events
> chain.Add("~/tmp/h1/dstarp2.root"); // 100675234 bytes 103053
> events
> }
>
>
>
> Is that what you have in mind?
>
>
> Cheers,
> Leo
>
>
> On Jan 19, 2007, at 4:20 PM, Antonio Bulgheroni wrote:
>
>> Dear Gerri,
>> thanks for your reply. That's the detailed situation:
>>
>> - the cluster is a dual core machine.
>> - proofd and rootd are running on the host using the standard ports.
>> - from the proof brand new GUI, I try to upload a set of files
>> from the DataSet tab.
>> - I leave the destination un-specified as recommended on the
>> Proof TWiki <http://root.cern.ch/twiki/bin/view/ROOT/ProofUsage>
>> and I got the message I was reporting
>>
>> If, instead of running the proofd and rootd, I run xrootd (using
>> as configuration $ROOTSYS/etc/proof/xpd.cf.sample ) it works
>> perfectly and I got the files saved into
>> /proofpoll/user/file.root and the dataset in ~user/proof/datasets/
>>
>> Regards,
>>
>>
>> Antonio
>>
>> On 1/19/07, * Gerri Ganis* <gerardo.ganis_at_cern.ch
>> <mailto:gerardo.ganis_at_cern.ch>> wrote:
>>
>>
>> Dear Antonio,
>>
>> I suspect a problem with the URL setting. Could you please
>> specify which
>> daemon should
>> receive the files on the cluster (is that rootd?) and how you
>> set the
>> destination URL?
>>
>> Of course, no matter the settings, you should not get a segv ...
>>
>> Gerri
>>
>>
>>
>> Antonio Bulgheroni wrote:
>>
>>> Dear ROOTers,
>>>
>>> I'm writing because I would like to call your
>> attention to
>>> the fact I'm not able to upload dataset on our cluster when
>> proofd
>>> (started from xinetd) is running, while everything goes
>> smooth when
>>> xrootd is running.
>>>
>>> Here comes the debug output:
>>>
>>> *** Break *** segmentation violation
>>> (no debugging symbols found)
>>> Using host libthread_db library "/lib/tls/libthread_db.so.1".
>>> Attaching to program: /proc/4571/exe, process 4571
>>> (no debugging symbols found)...done.
>>> (no debugging symbols found)...done.
>>> (no debugging symbols found)...done.
>>> [Thread debugging using libthread_db enabled]
>>> [New Thread -1208961344 (LWP 4571)]
>>> [New Thread -1219773520 (LWP 4970)]
>>> [New Thread -1209144400 (LWP 4648)]
>>> (no debugging symbols found)...done.
>>> (no debugging symbols found)...done.
>>> (no debugging symbols found)...done.
>>> (no debugging symbols found)...done.
>>> 0x0042f7a2 in _dl_sysinfo_int80
>>> () from /lib/ld-linux.so.2
>>> Thread 3 (Thread -1209144400 (LWP 4648)):
>>> #0 0x0042f7a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2
>>> #1 0x03487ba6 in __nanosleep_nocancel () from
>> /lib/tls/libc.so.6
>>> #2 0x034be32a in usleep () from /lib/tls/libc.so.6
>>> #3 0x06b44c6e in GarbageCollectorThread (arg=0xa15a7d8,
>>> thr=0xa159e98) at XrdClientConnMgr.cc:66
>>> #4 0x06b53256 in XrdClientThreadDispatcher (arg=0xa159ea4) at
>>> XrdClientThread.cc:29
>>> #5 0x06b606c2 in XrdOucThread_Xeq () from
>> /cern/root/lib/libXrdProofd.so
>>> #6 0x003ac371 in start_thread () from /lib/tls/libpthread.so.0
>>> #7 0x034c500e in clone () from /lib/tls/libc.so.6
>>>
>>> Thread 2 (Thread -1219773520 (LWP 4970)):
>>> #0 0x0042f7a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2
>>> #1 0x03487ba6 in __nanosleep_nocancel () from
>> /lib/tls/libc.so.6
>>> #2 0x034be32a in usleep () from /lib/tls/libc.so.6
>>> #3 0x06b44c6e in GarbageCollectorThread (arg=0xa2f4c78,
>>> thr=0xa320110) at XrdClientConnMgr.cc:66
>>> #4 0x06b53256 in XrdClientThreadDispatcher (arg=0xa32011c) at
>>> XrdClientThread.cc:29
>>> #5 0x06b606c2 in XrdOucThread_Xeq () from
>> /cern/root/lib/libXrdProofd.so
>>> #6 0x003ac371 in start_thread () from /lib/tls/libpthread.so.0
>>> #7 0x034c500e in clone () from /lib/tls/libc.so.6
>>>
>>> Thread 1 (Thread -1208961344 (LWP 4571)):
>>> #0 0x0042f7a2 in _dl_sysinfo_int80 () from /lib/ld- linux.so.2
>>> #1 0x0348749b in __waitpid_nocancel () from /lib/tls/libc.so.6
>>> #2 0x03431359 in do_system () from /lib/tls/libc.so.6
>>> #3 0x034316d1 in system () from /lib/tls/libc.so.6
>>> #4 0x003b278d in system () from /lib/tls/libpthread.so.0
>>> #5 0x00a9cd39 in TUnixSystem::Exec () from
>> /cern/root/lib/libCore.so
>>> #6 0x00a9d191 in TUnixSystem::StackTrace () from
>>> /cern/root/lib/libCore.so
>>> #7 0x00a9b1da in TUnixSystem::DispatchSignals () from
>>> /cern/root/lib/libCore.so
>>> #8 0x00a99201 in SigHandler () from /cern/root/lib/libCore.so
>>> #9 0x00a9f6f9 in sighandler () from /cern/root/lib/libCore.so
>>> #10 <signal handler called>
>>> #11 0x00000021 in ?? ()
>>> #12 0x01a990db in
>> TXNetSystemConnectGuard::~TXNetSystemConnectGuard ()
>>> from /cern/root/lib/libNetx.so
>>> #13 0x01a96923 in TXNetSystem::TXNetSystem () from
>>> /cern/root/lib/libNetx.so
>>> #14 0x01a9cb6c in G__G__Netx_220_0_6 () from
>> /cern/root/lib/libNetx.so
>>> #15 0x0105ea8f in Cint::G__CallFunc::Execute () from
>>> /cern/root/lib/libCint.so
>>> #16 0x00a18992 in TMethodCall::Execute () from
>> /cern/root/lib/libCore.so
>>> #17 0x00977bfe in TPluginHandler::ExecPlugin () from
>>> /cern/root/lib/libCore.so
>>> #18 0x009aa656 in TSystem::FindHelper () from
>> /cern/root/lib/libCore.so
>>> #19 0x00a9bdd1 in TUnixSystem::AccessPathName () from
>>> /cern/root/lib/libCore.so
>>> #20 0x016c2d6d in TProof::UploadDataSet () from
>> /cern/root/lib/libProof.so
>>> #21 0x0185772f in TUploadDataSetDlg::UploadDataSet () from
>>> /cern/root/lib/libProofGui.so
>>> #22 0x018567e1 in TUploadDataSetDlg::ProcessMessage () from
>>> /cern/root/lib/libProofGui.so
>>> #23 0x0463b984 in TGFrame::HandleClientMessage () from
>>> /cern/root/lib/libGui.so
>>> #24 0x0463f548 in TGMainFrame::HandleClientMessage () from
>>> /cern/root/lib/libGui.so
>>> #25 0x0463b6be in TGFrame::HandleEvent () from
>> /cern/root/lib/libGui.so
>>> #26 0x04612fe0 in TGClient::HandleEvent () from
>> /cern/root/lib/libGui.so
>>> #27 0x04612bf8 in TGClient::ProcessOneEvent () from
>>> /cern/root/lib/libGui.so
>>> #28 0x04612d17 in TGClient::HandleInput () from
>> /cern/root/lib/libGui.so
>>> #29 0x0461109e in TGInputHandler::Notify () from
>> /cern/root/lib/libGui.so
>>> #30 0x00a9a3dd in TUnixSystem::DispatchOneEvent () from
>>> /cern/root/lib/libCore.so
>>> #31 0x009a9b96 in TSystem::InnerLoop () from
>> /cern/root/lib/libCore.so
>>> #32 0x009a9b36 in TSystem::Run () from /cern/root/lib/libCore.so
>>> #33 0x009235c4 in TApplication::Run () from
>> /cern/root/lib/libCore.so
>>> #34 0x003647d8 in TRint::Run () from /cern/root/lib/libRint.so
>>> #35 0x08048d4b in main ()
>>>
>>> Thanks for your help,
>>>
>>> cheers,
>>>
>>> --
>>> Antonio Bulgheroni, PhD
>>>
>>>
>>
>>
>> --
>> +--------------------------------------------------------------------------+
>> Gerardo GANIS PH Department, CERN
>> address CERN, CH 1211 Geneve 23
>> room: 32-RC-017, tel / fax: +412276 76439
>> / 69133
>> e-mail gerardo.ganis_at_cern.ch
>> <mailto:gerardo.ganis_at_cern.ch>
>> +--------------------------------------------------------------------------+
>>
>>
>>
>
>

-- 
+--------------------------------------------------------------------------+
  Gerardo GANIS    PH Department, CERN
        address    CERN, CH 1211 Geneve 23  
                   room: 32-RC-017, tel / fax: +412276 76439 / 69133
         e-mail    gerardo.ganis_at_cern.ch
+--------------------------------------------------------------------------+
Received on Mon Jan 22 2007 - 16:18:00 CET

This archive was generated by hypermail 2.2.0 : Mon Jan 22 2007 - 23:50:00 CET