Re: Not able to upload dataset using proofd

From: Antonio Bulgheroni <antonio.bulgheroni_at_gmail.com>
Date: Mon, 22 Jan 2007 15:50:39 +0100


Yes sure, this is what I'm doing and it is working since I'm using a single machine. But what about the case of a real cluster? Sooner or later I need to move in that direction. Anyhow it is working with xpd... so no problem.

Thanks,

Cheers,

Antonio

-- 
Antonio Bulgheroni, PhD

It is ok to be ignorant in some areas,
but some people abuse the privilege.

On 1/22/07, Leandro Franco <lfranco_at_cern.ch> wrote:

>
> Hi...
>
> If you are working only in a local session I don't think you need to
> upload the files to the cluster (they are in your machine already)
>
> See this example:
>
> http://root.cern.ch/twiki/bin/view/ROOT/XpdExampleone
>
> or try something similar with a selector:
>
>
> root [0] TProof *proof = TProof::Open("")
>
> root [1] .x ~/dev/proof/h1chain.C
>
> root [2] TDSet *dset = new TDSet(chain)
>
> root [3]
> dset->Process("/Users/lfranco/dev/root/root/tutorials/tree/h1analysis.C")
> Starting h1analysis with process option:
> Looking up for exact location of files: OK (4 files)
> Validating files: OK (4 files)
> Master-0: grand total: sent 3 objects, size: 5456 bytes
> FCN=952.072 FROM MIGRAD STATUS=CONVERGED 220 CALLS 221
> TOTAL
> EDM=1.37809e-08 STRATEGY= 1 ERROR MATRIX
> ACCURATE
> EXT PARAMETER STEP FIRST
> NO. NAME VALUE ERROR SIZE DERIVATIVE
> 1 p0 9.59988e+05 9.07052e+04 7.92857e+01 -2.69327e-09
> 2 p1 3.51130e-01 2.32881e-02 4.69706e-05 5.29428e-03
> 3 p2 1.18502e+03 5.95939e+01 6.72112e-01 2.29626e-06
> 4 p3 1.45569e-01 5.93851e-05 8.69320e-07 -1.75026e+00
> 5 p4 1.24388e-03 6.63103e-05 7.86533e-07 -6.72392e-01
> (Long64_t)0
> root [4]
>
>
>
> where ~/dev/proof/h1chain.C :
>
> {
> TChain chain("h42");
> chain.Add("~/tmp/h1/dstarmb.root"); // 21330730 bytes 21920 events
> chain.Add("~/tmp/h1/dstarp1a.root"); // 71464503 bytes 73243 events
> chain.Add("~/tmp/h1/dstarp1b.root"); // 83827959 bytes 85597 events
> chain.Add("~/tmp/h1/dstarp2.root"); // 100675234 bytes 103053 events
> }
>
>
>
> Is that what you have in mind?
>
>
> Cheers,
> Leo
>
>
> On Jan 19, 2007, at 4:20 PM, Antonio Bulgheroni wrote:
>
> Dear Gerri,
> thanks for your reply. That's the detailed situation:
>
> - the cluster is a dual core machine.
> - proofd and rootd are running on the host using the standard ports.
> - from the proof brand new GUI, I try to upload a set of files from the
> DataSet tab.
> - I leave the destination un-specified as recommended on the Proof TWiki<http://root.cern.ch/twiki/bin/view/ROOT/ProofUsage>and I got the message I was reporting
>
> If, instead of running the proofd and rootd, I run xrootd (using as
> configuration $ROOTSYS/etc/proof/xpd.cf.sample ) it works perfectly and I
> got the files saved into /proofpoll/user/file.root and the dataset in
> ~user/proof/datasets/
>
> Regards,
>
>
> Antonio
>
> On 1/19/07, Gerri Ganis <gerardo.ganis_at_cern.ch> wrote:
> >
> >
> > Dear Antonio,
> >
> > I suspect a problem with the URL setting. Could you please specify which
> > daemon should
> > receive the files on the cluster (is that rootd?) and how you set the
> > destination URL?
> >
> > Of course, no matter the settings, you should not get a segv ...
> >
> > Gerri
> >
> >
> >
> > Antonio Bulgheroni wrote:
> >
> > > Dear ROOTers,
> > >
> > > I'm writing because I would like to call your attention to
> > > the fact I'm not able to upload dataset on our cluster when proofd
> > > (started from xinetd) is running, while everything goes smooth when
> > > xrootd is running.
> > >
> > > Here comes the debug output:
> > >
> > > *** Break *** segmentation violation
> > > (no debugging symbols found)
> > > Using host libthread_db library "/lib/tls/libthread_db.so.1".
> > > Attaching to program: /proc/4571/exe, process 4571
> > > (no debugging symbols found)...done.
> > > (no debugging symbols found)...done.
> > > (no debugging symbols found)...done.
> > > [Thread debugging using libthread_db enabled]
> > > [New Thread -1208961344 (LWP 4571)]
> > > [New Thread -1219773520 (LWP 4970)]
> > > [New Thread -1209144400 (LWP 4648)]
> > > (no debugging symbols found)...done.
> > > (no debugging symbols found)...done.
> > > (no debugging symbols found)...done.
> > > (no debugging symbols found)...done.
> > > 0x0042f7a2 in _dl_sysinfo_int80
> > > () from /lib/ld-linux.so.2
> > > Thread 3 (Thread -1209144400 (LWP 4648)):
> > > #0 0x0042f7a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2
> > > #1 0x03487ba6 in __nanosleep_nocancel () from /lib/tls/libc.so.6
> > > #2 0x034be32a in usleep () from /lib/tls/libc.so.6
> > > #3 0x06b44c6e in GarbageCollectorThread (arg=0xa15a7d8,
> > > thr=0xa159e98) at XrdClientConnMgr.cc:66
> > > #4 0x06b53256 in XrdClientThreadDispatcher (arg=0xa159ea4) at
> > > XrdClientThread.cc:29
> > > #5 0x06b606c2 in XrdOucThread_Xeq () from
> > /cern/root/lib/libXrdProofd.so
> > > #6 0x003ac371 in start_thread () from /lib/tls/libpthread.so.0
> > > #7 0x034c500e in clone () from /lib/tls/libc.so.6
> > >
> > > Thread 2 (Thread -1219773520 (LWP 4970)):
> > > #0 0x0042f7a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2
> > > #1 0x03487ba6 in __nanosleep_nocancel () from /lib/tls/libc.so.6
> > > #2 0x034be32a in usleep () from /lib/tls/libc.so.6
> > > #3 0x06b44c6e in GarbageCollectorThread (arg=0xa2f4c78,
> > > thr=0xa320110) at XrdClientConnMgr.cc:66
> > > #4 0x06b53256 in XrdClientThreadDispatcher (arg=0xa32011c) at
> > > XrdClientThread.cc:29
> > > #5 0x06b606c2 in XrdOucThread_Xeq () from
> > /cern/root/lib/libXrdProofd.so
> > > #6 0x003ac371 in start_thread () from /lib/tls/libpthread.so.0
> > > #7 0x034c500e in clone () from /lib/tls/libc.so.6
> > >
> > > Thread 1 (Thread -1208961344 (LWP 4571)):
> > > #0 0x0042f7a2 in _dl_sysinfo_int80 () from /lib/ld- linux.so.2
> > > #1 0x0348749b in __waitpid_nocancel () from /lib/tls/libc.so.6
> > > #2 0x03431359 in do_system () from /lib/tls/libc.so.6
> > > #3 0x034316d1 in system () from /lib/tls/libc.so.6
> > > #4 0x003b278d in system () from /lib/tls/libpthread.so.0
> > > #5 0x00a9cd39 in TUnixSystem::Exec () from /cern/root/lib/libCore.so
> > > #6 0x00a9d191 in TUnixSystem::StackTrace () from
> > > /cern/root/lib/libCore.so
> > > #7 0x00a9b1da in TUnixSystem::DispatchSignals () from
> > > /cern/root/lib/libCore.so
> > > #8 0x00a99201 in SigHandler () from /cern/root/lib/libCore.so
> > > #9 0x00a9f6f9 in sighandler () from /cern/root/lib/libCore.so
> > > #10 <signal handler called>
> > > #11 0x00000021 in ?? ()
> > > #12 0x01a990db in TXNetSystemConnectGuard::~TXNetSystemConnectGuard ()
> > > from /cern/root/lib/libNetx.so
> > > #13 0x01a96923 in TXNetSystem::TXNetSystem () from
> > > /cern/root/lib/libNetx.so
> > > #14 0x01a9cb6c in G__G__Netx_220_0_6 () from /cern/root/lib/libNetx.so
> > > #15 0x0105ea8f in Cint::G__CallFunc::Execute () from
> > > /cern/root/lib/libCint.so
> > > #16 0x00a18992 in TMethodCall::Execute () from
> > /cern/root/lib/libCore.so
> > > #17 0x00977bfe in TPluginHandler::ExecPlugin () from
> > > /cern/root/lib/libCore.so
> > > #18 0x009aa656 in TSystem::FindHelper () from
> > /cern/root/lib/libCore.so
> > > #19 0x00a9bdd1 in TUnixSystem::AccessPathName () from
> > > /cern/root/lib/libCore.so
> > > #20 0x016c2d6d in TProof::UploadDataSet () from
> > /cern/root/lib/libProof.so
> > > #21 0x0185772f in TUploadDataSetDlg::UploadDataSet () from
> > > /cern/root/lib/libProofGui.so
> > > #22 0x018567e1 in TUploadDataSetDlg::ProcessMessage () from
> > > /cern/root/lib/libProofGui.so
> > > #23 0x0463b984 in TGFrame::HandleClientMessage () from
> > > /cern/root/lib/libGui.so
> > > #24 0x0463f548 in TGMainFrame::HandleClientMessage () from
> > > /cern/root/lib/libGui.so
> > > #25 0x0463b6be in TGFrame::HandleEvent () from
> > /cern/root/lib/libGui.so
> > > #26 0x04612fe0 in TGClient::HandleEvent () from
> > /cern/root/lib/libGui.so
> > > #27 0x04612bf8 in TGClient::ProcessOneEvent () from
> > > /cern/root/lib/libGui.so
> > > #28 0x04612d17 in TGClient::HandleInput () from
> > /cern/root/lib/libGui.so
> > > #29 0x0461109e in TGInputHandler::Notify () from
> > /cern/root/lib/libGui.so
> > > #30 0x00a9a3dd in TUnixSystem::DispatchOneEvent () from
> > > /cern/root/lib/libCore.so
> > > #31 0x009a9b96 in TSystem::InnerLoop () from /cern/root/lib/libCore.so
> > > #32 0x009a9b36 in TSystem::Run () from /cern/root/lib/libCore.so
> > > #33 0x009235c4 in TApplication::Run () from /cern/root/lib/libCore.so
> > > #34 0x003647d8 in TRint::Run () from /cern/root/lib/libRint.so
> > > #35 0x08048d4b in main ()
> > >
> > > Thanks for your help,
> > >
> > > cheers,
> > >
> > > --
> > > Antonio Bulgheroni, PhD
> > >
> > >
> >
> >
> > --
> >
> > +--------------------------------------------------------------------------+
> > Gerardo GANIS PH Department, CERN
> > address CERN, CH 1211 Geneve 23
> > room: 32-RC-017, tel / fax: +412276 76439 / 69133
> > e-mail gerardo.ganis_at_cern.ch
> >
> > +--------------------------------------------------------------------------+
> >
> >
>
>
Received on Mon Jan 22 2007 - 15:51:14 CET

This archive was generated by hypermail 2.2.0 : Mon Jan 22 2007 - 17:50:00 CET