The Gaudi Framework  v29r2 (7a580596)
AvalancheSchedulerSvc Class Reference

Introduction

More...

#include <src/AvalancheSchedulerSvc.h>

Inheritance diagram for AvalancheSchedulerSvc:
Collaboration diagram for AvalancheSchedulerSvc:

Classes

struct  enqueueSchedulerActionTask
 
class  SchedulerState
 

Public Member Functions

 ~AvalancheSchedulerSvc () override=default
 Destructor. More...
 
StatusCode initialize () override
 Initialise. More...
 
StatusCode finalize () override
 Finalise. More...
 
StatusCode pushNewEvent (EventContext *eventContext) override
 Make an event available to the scheduler. More...
 
StatusCode pushNewEvents (std::vector< EventContext * > &eventContexts) override
 
StatusCode popFinishedEvent (EventContext *&eventContext) override
 Blocks until an event is availble. More...
 
StatusCode tryPopFinishedEvent (EventContext *&eventContext) override
 Try to fetch an event from the scheduler. More...
 
unsigned int freeSlots () override
 Get free slots number. More...
 
void addAlg (Algorithm *, EventContext *, pthread_t)
 
bool delAlg (Algorithm *)
 
void dumpState () override
 
- Public Member Functions inherited from extends< Service, IScheduler >
void * i_cast (const InterfaceID &tid) const override
 Implementation of IInterface::i_cast. More...
 
StatusCode queryInterface (const InterfaceID &ti, void **pp) override
 Implementation of IInterface::queryInterface. More...
 
std::vector< std::stringgetInterfaceNames () const override
 Implementation of IInterface::getInterfaceNames. More...
 
 ~extends () override=default
 Virtual destructor. More...
 
- Public Member Functions inherited from Service
const std::stringname () const override
 Retrieve name of the service. More...
 
StatusCode configure () override
 
StatusCode initialize () override
 
StatusCode start () override
 
StatusCode stop () override
 
StatusCode finalize () override
 
StatusCode terminate () override
 
Gaudi::StateMachine::State FSMState () const override
 
Gaudi::StateMachine::State targetFSMState () const override
 
StatusCode reinitialize () override
 
StatusCode restart () override
 
StatusCode sysInitialize () override
 Initialize Service. More...
 
StatusCode sysStart () override
 Initialize Service. More...
 
StatusCode sysStop () override
 Initialize Service. More...
 
StatusCode sysFinalize () override
 Finalize Service. More...
 
StatusCode sysReinitialize () override
 Re-initialize the Service. More...
 
StatusCode sysRestart () override
 Re-initialize the Service. More...
 
 Service (std::string name, ISvcLocator *svcloc)
 Standard Constructor. More...
 
SmartIF< ISvcLocator > & serviceLocator () const override
 Retrieve pointer to service locator. More...
 
StatusCode setProperties ()
 Method for setting declared properties to the values specified for the job. More...
 
template<class T >
StatusCode service (const std::string &name, const T *&psvc, bool createIf=true) const
 Access a service by name, creating it if it doesn't already exist. More...
 
template<class T >
StatusCode service (const std::string &name, T *&psvc, bool createIf=true) const
 
template<typename IFace = IService>
SmartIF< IFace > service (const std::string &name, bool createIf=true) const
 
template<class T >
StatusCode service (const std::string &svcType, const std::string &svcName, T *&psvc) const
 Access a service by name and type, creating it if it doesn't already exist. More...
 
template<class T >
StatusCode declareTool (ToolHandle< T > &handle, std::string toolTypeAndName, bool createIf=true)
 Declare used tool. More...
 
SmartIF< IAuditorSvc > & auditorSvc () const
 The standard auditor service.May not be invoked before sysInitialize() has been invoked. More...
 
- Public Member Functions inherited from PropertyHolder< CommonMessaging< implements< IService, IProperty, IStateful > > >
 PropertyHolder ()=default
 
 ~PropertyHolder () override=default
 
Gaudi::Details::PropertyBasedeclareProperty (Gaudi::Details::PropertyBase &prop)
 Declare a property. More...
 
Gaudi::Details::PropertyBasedeclareProperty (const std::string &name, TYPE &value, const std::string &doc="none")
 Helper to wrap a regular data member and use it as a regular property. More...
 
Gaudi::Details::PropertyBasedeclareProperty (const std::string &name, TYPE &value, const std::string &doc="none") const
 
Gaudi::Details::PropertyBasedeclareProperty (const std::string &name, Gaudi::Property< TYPE, VERIFIER, HANDLERS > &prop, const std::string &doc="none")
 Declare a PropertyBase instance setting name and documentation. More...
 
Gaudi::Details::PropertyBasedeclareRemoteProperty (const std::string &name, IProperty *rsvc, const std::string &rname="")
 Declare a remote property. More...
 
StatusCode setProperty (const Gaudi::Details::PropertyBase &p) override
 set the property form another property More...
 
StatusCode setProperty (const std::string &s) override
 set the property from the formatted string More...
 
StatusCode setProperty (const std::string &n, const std::string &v) override
 set the property from name and the value More...
 
StatusCode setProperty (const std::string &name, const TYPE &value)
 set the property form the value More...
 
StatusCode getProperty (Gaudi::Details::PropertyBase *p) const override
 get the property More...
 
const Gaudi::Details::PropertyBasegetProperty (const std::string &name) const override
 get the property by name More...
 
StatusCode getProperty (const std::string &n, std::string &v) const override
 convert the property to the string More...
 
const std::vector< Gaudi::Details::PropertyBase * > & getProperties () const override
 get all properties More...
 
bool hasProperty (const std::string &name) const override
 Return true if we have a property with the given name. More...
 
 PropertyHolder (const PropertyHolder &)=delete
 
PropertyHolderoperator= (const PropertyHolder &)=delete
 
Gaudi::Details::PropertyBasedeclareProperty (const std::string &name, GaudiHandleBase &ref, const std::string &doc="none")
 Specializations for various GaudiHandles. More...
 
Gaudi::Details::PropertyBasedeclareProperty (const std::string &name, GaudiHandleArrayBase &ref, const std::string &doc="none")
 
Gaudi::Details::PropertyBasedeclareProperty (const std::string &name, DataObjectHandleBase &ref, const std::string &doc="none")
 
- Public Member Functions inherited from CommonMessagingBase
virtual ~CommonMessagingBase ()=default
 Virtual destructor. More...
 
const SmartIF< IMessageSvc > & msgSvc () const
 The standard message service. More...
 
MsgStreammsgStream () const
 Return an uninitialized MsgStream. More...
 
MsgStreammsgStream (const MSG::Level level) const
 Predefined configurable message stream for the efficient printouts. More...
 
MsgStreamalways () const
 shortcut for the method msgStream(MSG::ALWAYS) More...
 
MsgStreamfatal () const
 shortcut for the method msgStream(MSG::FATAL) More...
 
MsgStreamerr () const
 shortcut for the method msgStream(MSG::ERROR) More...
 
MsgStreamerror () const
 shortcut for the method msgStream(MSG::ERROR) More...
 
MsgStreamwarning () const
 shortcut for the method msgStream(MSG::WARNING) More...
 
MsgStreaminfo () const
 shortcut for the method msgStream(MSG::INFO) More...
 
MsgStreamdebug () const
 shortcut for the method msgStream(MSG::DEBUG) More...
 
MsgStreamverbose () const
 shortcut for the method msgStream(MSG::VERBOSE) More...
 
MsgStreammsg () const
 shortcut for the method msgStream(MSG::INFO) More...
 
MSG::Level msgLevel () const
 get the cached level (originally extracted from the embedded MsgStream) More...
 
MSG::Level outputLevel () const __attribute__((deprecated))
 Backward compatibility function for getting the output level. More...
 
bool msgLevel (MSG::Level lvl) const
 get the output level from the embedded MsgStream More...
 
- Public Member Functions inherited from extend_interfaces< Interfaces... >
 ~extend_interfaces () override=default
 Virtual destructor. More...
 

Private Types

enum  ActivationState { INACTIVE = 0, ACTIVE = 1, FAILURE = 2 }
 

Private Member Functions

void activate ()
 Activate scheduler. More...
 
StatusCode deactivate ()
 Deactivate scheduler. More...
 
unsigned int algname2index (const std::string &algoname)
 Convert a name to an integer. More...
 
const std::stringindex2algname (unsigned int index)
 Convert an integer to a name. More...
 
StatusCode eventFailed (EventContext *eventContext)
 Method to check if an event failed and take appropriate actions. More...
 
StatusCode updateStates (int si=-1, const std::string &algo_name=std::string())
 Loop on algorithm in the slots and promote them to successive states (-1 means all slots, while empty string means skipping an update of the Control Flow state) More...
 
StatusCode promoteToScheduled (unsigned int iAlgo, int si)
 Algorithm promotion. More...
 
StatusCode promoteToAsyncScheduled (unsigned int iAlgo, int si)
 
StatusCode promoteToExecuted (unsigned int iAlgo, int si, IAlgorithm *algo, EventContext *)
 The call to this method is triggered only from within the AlgoExecutionTask. More...
 
StatusCode promoteToAsyncExecuted (unsigned int iAlgo, int si, IAlgorithm *algo, EventContext *)
 The call to this method is triggered only from within the IOBoundAlgTask. More...
 
StatusCode promoteToFinished (unsigned int iAlgo, int si)
 
StatusCode isStalled (int si)
 Check if the scheduling is in a stall. More...
 
void dumpSchedulerState (int iSlot)
 Dump the state of the scheduler. More...
 
StatusCode m_drain ()
 Drain the actions present in the queue. More...
 
void dumpState (std::ostringstream &)
 

Private Attributes

Gaudi::Property< int > m_threadPoolSize
 
Gaudi::Property< std::stringm_whiteboardSvcName {this, "WhiteboardSvc", "EventDataSvc", "The whiteboard name"}
 
Gaudi::Property< std::stringm_IOBoundAlgSchedulerSvcName {this, "IOBoundAlgSchedulerSvc", "IOBoundAlgSchedulerSvc"}
 
Gaudi::Property< unsigned int > m_maxIOBoundAlgosInFlight
 
Gaudi::Property< bool > m_simulateExecution
 
Gaudi::Property< std::stringm_optimizationMode
 
Gaudi::Property< bool > m_dumpIntraEventDynamics
 
Gaudi::Property< bool > m_useIOBoundAlgScheduler
 
Gaudi::Property< bool > m_checkDeps {this, "CheckDependencies", false, "Runtime check of Algorithm Data Dependencies"}
 
Gaudi::Property< std::stringm_useDataLoader
 
Gaudi::Property< bool > m_enableCondSvc {this, "EnableConditions", false, "Enable ConditionsSvc"}
 
Gaudi::Property< bool > m_showDataDeps
 
Gaudi::Property< bool > m_showDataFlow
 
Gaudi::Property< bool > m_showControlFlow
 
std::atomic< ActivationStatem_isActive {INACTIVE}
 Flag to track if the scheduler is active or not. More...
 
std::thread m_thread
 The thread in which the activate function runs. More...
 
std::unordered_map< std::string, unsigned int > m_algname_index_map
 Map to bookkeep the information necessary to the name2index conversion. More...
 
std::vector< std::stringm_algname_vect
 Vector to bookkeep the information necessary to the index2name conversion. More...
 
SmartIF< IPrecedenceSvcm_precSvc
 A shortcut to the Precedence Service. More...
 
SmartIF< IHiveWhiteBoardm_whiteboard
 A shortcut to the whiteboard. More...
 
SmartIF< IAcceleratorm_IOBoundAlgScheduler
 A shortcut to IO-bound algorithm scheduler. More...
 
std::vector< EventSlotm_eventSlots
 Vector of events slots. More...
 
std::atomic_int m_freeSlots
 Atomic to account for asyncronous updates by the scheduler wrt the rest. More...
 
tbb::concurrent_bounded_queue< EventContext * > m_finishedEvents
 Queue of finished events. More...
 
SmartIF< IAlgExecStateSvcm_algExecStateSvc
 Algorithm execution state manager. More...
 
SmartIF< ICondSvcm_condSvc
 A shortcut to service for Conditions handling. More...
 
unsigned int m_algosInFlight = 0
 Number of algoritms presently in flight. More...
 
unsigned int m_IOBoundAlgosInFlight = 0
 Number of algoritms presently in flight. More...
 
bool m_updateNeeded = true
 Keep track of update actions scheduled. More...
 
SmartIF< IAlgResourcePoolm_algResourcePool
 Cache for the algorithm resource pool. More...
 
tbb::concurrent_bounded_queue< actionm_actionsQueue
 Queue where closures are stored and picked for execution. More...
 
SmartIF< IThreadPoolSvcm_threadPoolSvc
 
size_t m_maxEventsInFlight {0}
 
size_t m_maxAlgosInFlight {1}
 
bool m_first = true
 

Static Private Attributes

static std::list< SchedulerStatem_sState
 
static std::mutex m_ssMut
 

Additional Inherited Members

- Public Types inherited from extends< Service, IScheduler >
using base_class = extends
 Typedef to this class. More...
 
using extend_interfaces_base = extend_interfaces< Interfaces... >
 Typedef to the base of this class. More...
 
- Public Types inherited from Service
typedef Gaudi::PluginService::Factory< IService *, const std::string &, ISvcLocator * > Factory
 
- Public Types inherited from PropertyHolder< CommonMessaging< implements< IService, IProperty, IStateful > > >
using PropertyHolderImpl = PropertyHolder< CommonMessaging< implements< IService, IProperty, IStateful > > >
 Typedef used to refer to this class from derived classes, as in. More...
 
- Public Types inherited from CommonMessaging< implements< IService, IProperty, IStateful > >
using base_class = CommonMessaging
 
- Public Types inherited from extend_interfaces< Interfaces... >
using ext_iids = typename Gaudi::interface_list_cat< typename Interfaces::ext_iids... >::type
 take union of the ext_iids of all Interfaces... More...
 
- Protected Member Functions inherited from Service
 ~Service () override
 Standard Destructor. More...
 
int outputLevel () const
 get the Service's output level More...
 
- Protected Member Functions inherited from PropertyHolder< CommonMessaging< implements< IService, IProperty, IStateful > > >
Gaudi::Details::PropertyBaseproperty (const std::string &name) const
 
- Protected Member Functions inherited from CommonMessaging< implements< IService, IProperty, IStateful > >
MSG::Level setUpMessaging ()
 Set up local caches. More...
 
MSG::Level resetMessaging ()
 Reinitialize internal states. More...
 
void updateMsgStreamOutputLevel (int level)
 Update the output level of the cached MsgStream. More...
 
- Protected Attributes inherited from Service
Gaudi::StateMachine::State m_state = Gaudi::StateMachine::OFFLINE
 Service state. More...
 
Gaudi::StateMachine::State m_targetState = Gaudi::StateMachine::OFFLINE
 Service state. More...
 
Gaudi::Property< int > m_outputLevel {this, "OutputLevel", MSG::NIL, "output level"}
 
Gaudi::Property< bool > m_auditInit {this, "AuditServices", false, "[[deprecated]] unused"}
 
Gaudi::Property< bool > m_auditorInitialize {this, "AuditInitialize", false, "trigger auditor on initialize()"}
 
Gaudi::Property< bool > m_auditorStart {this, "AuditStart", false, "trigger auditor on start()"}
 
Gaudi::Property< bool > m_auditorStop {this, "AuditStop", false, "trigger auditor on stop()"}
 
Gaudi::Property< bool > m_auditorFinalize {this, "AuditFinalize", false, "trigger auditor on finalize()"}
 
Gaudi::Property< bool > m_auditorReinitialize {this, "AuditReinitialize", false, "trigger auditor on reinitialize()"}
 
Gaudi::Property< bool > m_auditorRestart {this, "AuditRestart", false, "trigger auditor on restart()"}
 
SmartIF< IAuditorSvcm_pAuditorSvc
 Auditor Service. More...
 

Detailed Description

Introduction

The scheduler is named after its ability to generically maximize the average intra-event task occupancy by inducing avalanche-like concurrency disclosure waves in conditions of arbitrary intra-event task precedence constraints (see section 3.2 of http://cern.ch/go/7Jn7).

Task precedence management

The scheduler is driven by graph-based task precedence management. When compared to approach used in the ForwardSchedulerSvc, the following advantages can be emphasized:

(1) Faster decision making (thus lower concurrency disclosure downtime); (2) Capacity for proactive task scheduling decision making.

Point (2) allowed to implement a number of generic, non-intrusive intra-event throughput maximization scheduling strategies.

Scheduling principles

o Task scheduling prerequisites

A task is scheduled ASA all following conditions are met:

  • if a control flow (CF) graph traversal reaches the task;
  • when all data flow (DF) dependencies of the task are satisfied;
  • when the DF-ready task pool parsing mechanism (*) considers it, and:
    • a free (or re-entrant) algorithm instance to run within the task is available;
    • there is a free computational resource to run the task.

o (*) Avalanche induction strategies

The scheduler is able to maximize the intra-event throughput by applying several search strategies within the pool, prioritizing tasks according to the following types of precedence rules graph asymmetries:

(A) Local task-to-data asymmetry; (B) Local task-to-task asymmetry; (C) Global task-to-task asymmetry.

o Other mechanisms of throughput maximization

The scheduler is able to maximize the overall throughput of data processing by scheduling the CPU-blocking tasks efficiently. The mechanism can be applied to the following types of tasks:

  • I/O-bound tasks;
  • tasks with computation offloading (accelerators, GPGPUs, clouds, quantum computing devices..joke);
  • synchronization-bound tasks.

Credits

Historically, the AvalancheSchedulerSvc branched off the ForwardSchedulerSvc and in many ways built its success on ideas and code of the latter.

Author
Illya Shapoval
Version
1.0

Definition at line 102 of file AvalancheSchedulerSvc.h.

Member Enumeration Documentation

Constructor & Destructor Documentation

AvalancheSchedulerSvc::~AvalancheSchedulerSvc ( )
overridedefault

Destructor.

Member Function Documentation

void AvalancheSchedulerSvc::activate ( )
private

Activate scheduler.

Activate the scheduler.

From this moment on the queue of actions is checked. The checking will stop when the m_isActive flag is false and the queue is not empty. This will guarantee that all actions are executed and a stall is not created. The TBB pool must be initialised in the thread from where the tasks are launched (http://threadingbuildingblocks.org/docs/doxygen/a00342.html) The scheduler is initialised here since this method runs in a separate thread and spawns the tasks (through the execution of the lambdas)

Definition at line 363 of file AvalancheSchedulerSvc.cpp.

364 {
365 
366  if ( msgLevel( MSG::DEBUG ) ) debug() << "AvalancheSchedulerSvc::activate()" << endmsg;
367 
369  error() << "problems initializing ThreadPoolSvc" << endmsg;
371  return;
372  }
373 
374  // Wait for actions pushed into the queue by finishing tasks.
375  action thisAction;
377 
378  m_isActive = ACTIVE;
379 
380  // Continue to wait if the scheduler is running or there is something to do
381  info() << "Start checking the actionsQueue" << endmsg;
382  while ( m_isActive == ACTIVE or m_actionsQueue.size() != 0 ) {
383  m_actionsQueue.pop( thisAction );
384  sc = thisAction();
385  if ( sc != StatusCode::SUCCESS )
386  verbose() << "Action did not succeed (which is not bad per se)." << endmsg;
387  else
388  verbose() << "Action succeeded." << endmsg;
389  }
390 
391  info() << "Terminating thread-pool resources" << endmsg;
393  error() << "Problems terminating thread pool" << endmsg;
395  }
396 }
virtual StatusCode initPool(const int &poolSize)=0
Initializes the thread pool.
MsgStream & info() const
shortcut for the method msgStream(MSG::INFO)
MsgStream & verbose() const
shortcut for the method msgStream(MSG::VERBOSE)
bool isFailure() const
Test for a status code of FAILURE.
Definition: StatusCode.h:61
std::atomic< ActivationState > m_isActive
Flag to track if the scheduler is active or not.
virtual StatusCode terminatePool()=0
Finalize the thread pool.
MsgStream & error() const
shortcut for the method msgStream(MSG::ERROR)
This class is used for returning status codes from appropriate routines.
Definition: StatusCode.h:26
Gaudi::Property< int > m_threadPoolSize
SmartIF< IThreadPoolSvc > m_threadPoolSvc
MsgStream & debug() const
shortcut for the method msgStream(MSG::DEBUG)
MSG::Level msgLevel() const
get the cached level (originally extracted from the embedded MsgStream)
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
tbb::concurrent_bounded_queue< action > m_actionsQueue
Queue where closures are stored and picked for execution.
void AvalancheSchedulerSvc::addAlg ( Algorithm a,
EventContext e,
pthread_t  t 
)

Definition at line 1107 of file AvalancheSchedulerSvc.cpp.

1108 {
1109 
1111  m_sState.push_back( SchedulerState( a, e, t ) );
1112 }
static std::list< SchedulerState > m_sState
T lock(T...args)
unsigned int AvalancheSchedulerSvc::algname2index ( const std::string algoname)
inlineprivate

Convert a name to an integer.

Definition at line 431 of file AvalancheSchedulerSvc.cpp.

432 {
433  unsigned int index = m_algname_index_map[algoname];
434  return index;
435 }
std::unordered_map< std::string, unsigned int > m_algname_index_map
Map to bookkeep the information necessary to the name2index conversion.
StatusCode AvalancheSchedulerSvc::deactivate ( )
private

Deactivate scheduler.

Deactivates the scheduler.

Two actions are pushed into the queue: 1) Drain the scheduler until all events are finished. 2) Flip the status flag m_isActive to false This second action is the last one to be executed by the scheduler.

Definition at line 406 of file AvalancheSchedulerSvc.cpp.

407 {
408 
409  if ( m_isActive == ACTIVE ) {
410  // Drain the scheduler
412  // This would be the last action
413  m_actionsQueue.push( [this]() -> StatusCode {
415  return StatusCode::SUCCESS;
416  } );
417  }
418 
419  return StatusCode::SUCCESS;
420 }
std::atomic< ActivationState > m_isActive
Flag to track if the scheduler is active or not.
This class is used for returning status codes from appropriate routines.
Definition: StatusCode.h:26
T bind(T...args)
tbb::concurrent_bounded_queue< action > m_actionsQueue
Queue where closures are stored and picked for execution.
StatusCode m_drain()
Drain the actions present in the queue.
bool AvalancheSchedulerSvc::delAlg ( Algorithm a)

Definition at line 1115 of file AvalancheSchedulerSvc.cpp.

1116 {
1117 
1119 
1120  for ( std::list<SchedulerState>::iterator itr = m_sState.begin(); itr != m_sState.end(); ++itr ) {
1121  if ( *itr == a ) {
1122  m_sState.erase( itr );
1123  return true;
1124  }
1125  }
1126 
1127  error() << "could not find Alg " << a->name() << " in Scheduler!" << endmsg;
1128  return false;
1129 }
const std::string & name() const override
The identifying name of the algorithm object.
Definition: Algorithm.cpp:731
static std::list< SchedulerState > m_sState
MsgStream & error() const
shortcut for the method msgStream(MSG::ERROR)
T lock(T...args)
STL class.
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
void AvalancheSchedulerSvc::dumpSchedulerState ( int  iSlot)
private

Dump the state of the scheduler.

Used for debugging purposes, the state of the scheduler is dumped on screen in order to be inspected.

The dependencies of each algo are printed and the missing ones specified.

Definition at line 823 of file AvalancheSchedulerSvc.cpp.

824 {
825 
826  // To have just one big message
827  std::ostringstream outputMessageStream;
828 
829  outputMessageStream << "============================== Execution Task State ============================="
830  << std::endl;
831  dumpState( outputMessageStream );
832 
833  outputMessageStream << std::endl
834  << "============================== Scheduler State ================================="
835  << std::endl;
836 
837  int slotCount = -1;
838  for ( auto& thisSlot : m_eventSlots ) {
839  slotCount++;
840  if ( thisSlot.complete ) continue;
841 
842  // dump temporal and topological precedence analysis (if enabled in the PrecedenceSvc)
843  if ( msgLevel( MSG::DEBUG ) ) m_precSvc->dumpPrecedenceRules( thisSlot );
844 
845  outputMessageStream << "----------- slot: " << thisSlot.eventContext->slot()
846  << " event: " << thisSlot.eventContext->evt() << " -----------" << std::endl;
847 
848  if ( 0 > iSlot or iSlot == slotCount ) {
849  outputMessageStream << "Algorithms states:" << std::endl;
850 
851  const DataObjIDColl& wbSlotContent( thisSlot.dataFlowMgr.content() );
852  for ( unsigned int algoIdx = 0; algoIdx < thisSlot.algsStates.size(); ++algoIdx ) {
853  outputMessageStream << " o " << index2algname( algoIdx ) << " ["
854  << AlgsExecutionStates::stateNames[thisSlot.algsStates[algoIdx]] << "] Data deps: ";
855  DataObjIDColl deps( thisSlot.dataFlowMgr.dataDependencies( algoIdx ) );
856  const int depsSize = deps.size();
857  if ( depsSize == 0 ) outputMessageStream << " none";
858 
859  DataObjIDColl missing;
860  for ( auto d : deps ) {
861  outputMessageStream << d << " ";
862  if ( wbSlotContent.find( d ) == wbSlotContent.end() ) {
863  // outputMessageStream << "[missing] ";
864  missing.insert( d );
865  }
866  }
867 
868  outputMessageStream << std::endl;
869  }
870 
871  // Snapshot of the WhiteBoard
872  outputMessageStream << "\nWhiteboard contents: " << std::endl;
873  for ( auto& product : wbSlotContent ) outputMessageStream << " o " << product << std::endl;
874 
875  // Snapshot of the ControlFlow
876  outputMessageStream << "\nControl Flow:" << std::endl;
877  outputMessageStream << m_precSvc->printState( thisSlot ) << std::endl;
878  }
879  }
880 
881  outputMessageStream << "=================================== END ======================================" << std::endl;
882 
883  info() << "Dumping Scheduler State " << std::endl << outputMessageStream.str() << endmsg;
884 }
SmartIF< IPrecedenceSvc > m_precSvc
A shortcut to the Precedence Service.
MsgStream & info() const
shortcut for the method msgStream(MSG::INFO)
T endl(T...args)
virtual const std::string printState(EventSlot &) const =0
virtual void dumpPrecedenceRules(EventSlot &)=0
Dump precedence rules.
T insert(T...args)
T size(T...args)
const std::string & index2algname(unsigned int index)
Convert an integer to a name.
std::vector< EventSlot > m_eventSlots
Vector of events slots.
MSG::Level msgLevel() const
get the cached level (originally extracted from the embedded MsgStream)
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
static std::map< State, std::string > stateNames
void AvalancheSchedulerSvc::dumpState ( )
override

Definition at line 1143 of file AvalancheSchedulerSvc.cpp.

1144 {
1145 
1147 
1148  std::ostringstream ost;
1149  ost << "dumping Executing Threads: [" << m_sState.size() << "]" << std::endl;
1150  dumpState( ost );
1151 
1152  info() << ost.str() << endmsg;
1153 }
MsgStream & info() const
shortcut for the method msgStream(MSG::INFO)
T endl(T...args)
static std::list< SchedulerState > m_sState
T lock(T...args)
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
void AvalancheSchedulerSvc::dumpState ( std::ostringstream ost)
private

Definition at line 1132 of file AvalancheSchedulerSvc.cpp.

1133 {
1134 
1136 
1137  for ( auto it : m_sState ) {
1138  ost << " " << it << std::endl;
1139  }
1140 }
T endl(T...args)
static std::list< SchedulerState > m_sState
T lock(T...args)
StatusCode AvalancheSchedulerSvc::eventFailed ( EventContext eventContext)
private

Method to check if an event failed and take appropriate actions.

It can be possible that an event fails.

In this case this method is called. It dumps the state of the scheduler, drains the actions (without executing them) and events in the queues and returns a failure.

Definition at line 580 of file AvalancheSchedulerSvc.cpp.

581 {
582 
583  // Set the number of slots available to an error code
584  m_freeSlots.store( 0 );
585 
586  fatal() << "*** Event " << eventContext->evt() << " on slot " << eventContext->slot() << " failed! ***" << endmsg;
587 
588  std::ostringstream ost;
589  m_algExecStateSvc->dump( ost, *eventContext );
590 
591  info() << "Dumping Alg Exec State for slot " << eventContext->slot() << ":\n" << ost.str() << endmsg;
592 
593  dumpSchedulerState( -1 );
594 
595  // Empty queue and deactivate the service
596  action thisAction;
597  while ( m_actionsQueue.try_pop( thisAction ) ) {
598  };
599  deactivate();
600 
601  // Push into the finished events queue the failed context
602  EventContext* thisEvtContext;
603  while ( m_finishedEvents.try_pop( thisEvtContext ) ) {
604  m_finishedEvents.push( thisEvtContext );
605  };
606  m_finishedEvents.push( eventContext );
607 
608  return StatusCode::FAILURE;
609 }
ContextID_t slot() const
Definition: EventContext.h:40
MsgStream & info() const
shortcut for the method msgStream(MSG::INFO)
void dumpSchedulerState(int iSlot)
Dump the state of the scheduler.
virtual void dump(std::ostringstream &ost, const EventContext &ctx) const =0
This class represents an entry point to all the event specific data.
Definition: EventContext.h:24
ContextEvt_t evt() const
Definition: EventContext.h:39
std::atomic_int m_freeSlots
Atomic to account for asyncronous updates by the scheduler wrt the rest.
tbb::concurrent_bounded_queue< EventContext * > m_finishedEvents
Queue of finished events.
StatusCode deactivate()
Deactivate scheduler.
MsgStream & fatal() const
shortcut for the method msgStream(MSG::FATAL)
SmartIF< IAlgExecStateSvc > m_algExecStateSvc
Algorithm execution state manager.
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
tbb::concurrent_bounded_queue< action > m_actionsQueue
Queue where closures are stored and picked for execution.
StatusCode AvalancheSchedulerSvc::finalize ( )
override

Finalise.

Here the scheduler is deactivated and the thread joined.

Definition at line 332 of file AvalancheSchedulerSvc.cpp.

333 {
334 
336  if ( !sc.isSuccess() ) warning() << "Base class could not be finalized" << endmsg;
337 
338  sc = deactivate();
339  if ( !sc.isSuccess() ) warning() << "Scheduler could not be deactivated" << endmsg;
340 
341  info() << "Joining Scheduler thread" << endmsg;
342  m_thread.join();
343 
344  // Final error check after thread pool termination
345  if ( m_isActive == FAILURE ) {
346  error() << "problems in scheduler thread" << endmsg;
347  return StatusCode::FAILURE;
348  }
349 
350  return sc;
351 }
StatusCode finalize() override
Definition: Service.cpp:174
MsgStream & info() const
shortcut for the method msgStream(MSG::INFO)
std::atomic< ActivationState > m_isActive
Flag to track if the scheduler is active or not.
MsgStream & error() const
shortcut for the method msgStream(MSG::ERROR)
MsgStream & warning() const
shortcut for the method msgStream(MSG::WARNING)
T join(T...args)
This class is used for returning status codes from appropriate routines.
Definition: StatusCode.h:26
StatusCode deactivate()
Deactivate scheduler.
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
std::thread m_thread
The thread in which the activate function runs.
unsigned int AvalancheSchedulerSvc::freeSlots ( )
override

Get free slots number.

Definition at line 517 of file AvalancheSchedulerSvc.cpp.

517 { return std::max( m_freeSlots.load(), 0 ); }
std::atomic_int m_freeSlots
Atomic to account for asyncronous updates by the scheduler wrt the rest.
T max(T...args)
const std::string & AvalancheSchedulerSvc::index2algname ( unsigned int  index)
inlineprivate

Convert an integer to a name.

Definition at line 427 of file AvalancheSchedulerSvc.cpp.

427 { return m_algname_vect[index]; }
std::vector< std::string > m_algname_vect
Vector to bookkeep the information necessary to the index2name conversion.
StatusCode AvalancheSchedulerSvc::initialize ( )
override

Initialise.

Here, among some "bureaucracy" operations, the scheduler is activated, executing the activate() function in a new thread.

In addition the algorithms list is acquired from the algResourcePool.

Definition at line 61 of file AvalancheSchedulerSvc.cpp.

62 {
63 
64  // Initialise mother class (read properties, ...)
66  if ( !sc.isSuccess() ) warning() << "Base class could not be initialized" << endmsg;
67 
68  // Get hold of the TBBSvc. This should initialize the thread pool
69  m_threadPoolSvc = serviceLocator()->service( "ThreadPoolSvc" );
70  if ( !m_threadPoolSvc.isValid() ) {
71  fatal() << "Error retrieving ThreadPoolSvc" << endmsg;
72  return StatusCode::FAILURE;
73  }
74 
75  // Activate the scheduler in another thread.
76  info() << "Activating scheduler in a separate thread" << endmsg;
78 
79  while ( m_isActive != ACTIVE ) {
80  if ( m_isActive == FAILURE ) {
81  fatal() << "Terminating initialization" << endmsg;
82  return StatusCode::FAILURE;
83  } else {
84  info() << "Waiting for AvalancheSchedulerSvc to activate" << endmsg;
85  sleep( 1 );
86  }
87  }
88 
89  if ( m_enableCondSvc ) {
90  // Get hold of the CondSvc
91  m_condSvc = serviceLocator()->service( "CondSvc" );
92  if ( !m_condSvc.isValid() ) {
93  warning() << "No CondSvc found, or not enabled. "
94  << "Will not manage CondAlgorithms" << endmsg;
95  m_enableCondSvc = false;
96  }
97  }
98 
99  // Get the algo resource pool
100  m_algResourcePool = serviceLocator()->service( "AlgResourcePool" );
101  if ( !m_algResourcePool.isValid() ) {
102  fatal() << "Error retrieving AlgoResourcePool" << endmsg;
103  return StatusCode::FAILURE;
104  }
105 
106  // Get the precedence service
107  m_precSvc = serviceLocator()->service( "PrecedenceSvc" );
108  if ( !m_precSvc.isValid() ) {
109  fatal() << "Error retrieving PrecedenceSvc" << endmsg;
110  return StatusCode::FAILURE;
111  }
112  const PrecedenceSvc* precSvc = dynamic_cast<const PrecedenceSvc*>( m_precSvc.get() );
113  if ( !precSvc ) {
114  fatal() << "Unable to dcast PrecedenceSvc" << endmsg;
115  return StatusCode::FAILURE;
116  }
117 
118  m_algExecStateSvc = serviceLocator()->service( "AlgExecStateSvc" );
119  if ( !m_algExecStateSvc.isValid() ) {
120  fatal() << "Error retrieving AlgExecStateSvc" << endmsg;
121  return StatusCode::FAILURE;
122  }
123 
124  // Get Whiteboard
126  if ( !m_whiteboard.isValid() ) {
127  fatal() << "Error retrieving EventDataSvc interface IHiveWhiteBoard." << endmsg;
128  return StatusCode::FAILURE;
129  }
130 
131  // Get dedicated scheduler for I/O-bound algorithms
132  if ( m_useIOBoundAlgScheduler ) {
135  fatal() << "Error retrieving IOBoundSchedulerAlgSvc interface IAccelerator." << endmsg;
136  }
137 
138  // Set the MaxEventsInFlight parameters from the number of WB stores
140 
141  // Set the number of free slots
143 
144  // set global concurrency flags
146 
147  // Get the list of algorithms
149  const unsigned int algsNumber = algos.size();
150  info() << "Found " << algsNumber << " algorithms" << endmsg;
151 
152  /* Dependencies
153  1) Look for handles in algo, if none
154  2) Assume none are required
155  */
156 
157  DataObjIDColl globalInp, globalOutp;
158 
159  // figure out all outputs
160  for ( IAlgorithm* ialgoPtr : algos ) {
161  Algorithm* algoPtr = dynamic_cast<Algorithm*>( ialgoPtr );
162  if ( !algoPtr ) {
163  fatal() << "Could not convert IAlgorithm into Algorithm: this will result in a crash." << endmsg;
164  }
165  for ( auto id : algoPtr->outputDataObjs() ) {
166  auto r = globalOutp.insert( id );
167  if ( !r.second ) {
168  warning() << "multiple algorithms declare " << id << " as output! could be a single instance in multiple paths "
169  "though, or control flow may guarantee only one runs...!"
170  << endmsg;
171  }
172  }
173  }
174 
175  std::ostringstream ostdd;
176  ostdd << "Data Dependencies for Algorithms:";
177 
178  std::vector<DataObjIDColl> m_algosDependencies;
179  for ( IAlgorithm* ialgoPtr : algos ) {
180  Algorithm* algoPtr = dynamic_cast<Algorithm*>( ialgoPtr );
181  if ( nullptr == algoPtr ) {
182  fatal() << "Could not convert IAlgorithm into Algorithm for " << ialgoPtr->name()
183  << ": this will result in a crash." << endmsg;
184  return StatusCode::FAILURE;
185  }
186 
187  ostdd << "\n " << algoPtr->name();
188 
189  DataObjIDColl algoDependencies;
190  if ( !algoPtr->inputDataObjs().empty() || !algoPtr->outputDataObjs().empty() ) {
191  for ( const DataObjID* idp : sortedDataObjIDColl( algoPtr->inputDataObjs() ) ) {
192  DataObjID id = *idp;
193  ostdd << "\n o INPUT " << id;
194  if ( id.key().find( ":" ) != std::string::npos ) {
195  ostdd << " contains alternatives which require resolution...\n";
196  auto tokens = boost::tokenizer<boost::char_separator<char>>{id.key(), boost::char_separator<char>{":"}};
197  auto itok = std::find_if( tokens.begin(), tokens.end(), [&]( const std::string& t ) {
198  return globalOutp.find( DataObjID{t} ) != globalOutp.end();
199  } );
200  if ( itok != tokens.end() ) {
201  ostdd << "found matching output for " << *itok << " -- updating scheduler info\n";
202  id.updateKey( *itok );
203  } else {
204  error() << "failed to find alternate in global output list"
205  << " for id: " << id << " in Alg " << algoPtr->name() << endmsg;
206  m_showDataDeps = true;
207  }
208  }
209  algoDependencies.insert( id );
210  globalInp.insert( id );
211  }
212  for ( const DataObjID* id : sortedDataObjIDColl( algoPtr->outputDataObjs() ) ) {
213  ostdd << "\n o OUTPUT " << *id;
214  if ( id->key().find( ":" ) != std::string::npos ) {
215  error() << " in Alg " << algoPtr->name() << " alternatives are NOT allowed for outputs! id: " << *id
216  << endmsg;
217  m_showDataDeps = true;
218  }
219  }
220  } else {
221  ostdd << "\n none";
222  }
223  m_algosDependencies.emplace_back( algoDependencies );
224  }
225 
226  if ( m_showDataDeps ) {
227  info() << ostdd.str() << endmsg;
228  }
229 
230  // Fill the containers to convert algo names to index
231  m_algname_vect.resize( algsNumber );
232  IAlgorithm* dataLoaderAlg( nullptr );
233  for ( IAlgorithm* algo : algos ) {
234  const std::string& name = algo->name();
235  auto index = precSvc->getRules()->getAlgorithmNode( name )->getAlgoIndex();
236  m_algname_index_map[name] = index;
237  m_algname_vect.at( index ) = name;
238  if ( algo->name() == m_useDataLoader ) {
239  dataLoaderAlg = algo;
240  }
241  }
242 
243  // Check if we have unmet global input dependencies
244  if ( m_checkDeps ) {
245  DataObjIDColl unmetDep;
246  for ( auto o : globalInp ) {
247  if ( globalOutp.find( o ) == globalOutp.end() ) {
248  unmetDep.insert( o );
249  }
250  }
251 
252  if ( unmetDep.size() > 0 ) {
253 
254  std::ostringstream ost;
255  for ( const DataObjID* o : sortedDataObjIDColl( unmetDep ) ) {
256  ost << "\n o " << *o << " required by Algorithm: ";
257  for ( size_t i = 0; i < m_algosDependencies.size(); ++i ) {
258  if ( m_algosDependencies[i].find( *o ) != m_algosDependencies[i].end() ) {
259  ost << "\n * " << m_algname_vect[i];
260  }
261  }
262  }
263 
264  if ( m_useDataLoader != "" ) {
265  // Find the DataLoader Alg
266  if ( dataLoaderAlg == nullptr ) {
267  fatal() << "No DataLoader Algorithm \"" << m_useDataLoader.value()
268  << "\" found, and unmet INPUT dependencies "
269  << "detected:\n"
270  << ost.str() << endmsg;
271  return StatusCode::FAILURE;
272  }
273 
274  info() << "Will attribute the following unmet INPUT dependencies to \"" << dataLoaderAlg->type() << "/"
275  << dataLoaderAlg->name() << "\" Algorithm" << ost.str() << endmsg;
276 
277  // Set the property Load of DataLoader Alg
278  Algorithm* dataAlg = dynamic_cast<Algorithm*>( dataLoaderAlg );
279  if ( !dataAlg ) {
280  fatal() << "Unable to dcast DataLoader \"" << m_useDataLoader.value() << "\" IAlg to Algorithm" << endmsg;
281  return StatusCode::FAILURE;
282  }
283 
284  for ( auto& id : unmetDep ) {
285  debug() << "adding OUTPUT dep \"" << id << "\" to " << dataLoaderAlg->type() << "/" << dataLoaderAlg->name()
286  << endmsg;
288  }
289 
290  } else {
291  fatal() << "Auto DataLoading not requested, "
292  << "and the following unmet INPUT dependencies were found:" << ost.str() << endmsg;
293  return StatusCode::FAILURE;
294  }
295 
296  } else {
297  info() << "No unmet INPUT data dependencies were found" << endmsg;
298  }
299  }
300 
301  // Shortcut for the message service
302  SmartIF<IMessageSvc> messageSvc( serviceLocator() );
303  if ( !messageSvc.isValid() ) error() << "Error retrieving MessageSvc interface IMessageSvc." << endmsg;
304 
305  m_eventSlots.assign( m_maxEventsInFlight, EventSlot( m_algosDependencies, algsNumber,
306  precSvc->getRules()->getControlFlowNodeCounter(), messageSvc ) );
307  std::for_each( m_eventSlots.begin(), m_eventSlots.end(), []( EventSlot& slot ) { slot.complete = true; } );
308 
309  if ( m_threadPoolSize > 1 ) {
311  }
312 
313  // Clearly inform about the level of concurrency
314  info() << "Concurrency level information:" << endmsg;
315  info() << " o Number of events in flight: " << m_maxEventsInFlight << endmsg;
316  info() << " o TBB thread pool size: " << m_threadPoolSize << endmsg;
317 
319 
321 
322  // Simulate execution flow
324 
325  return sc;
326 }
const concurrency::PrecedenceRulesGraph * getRules() const
Precedence rules accessor.
Definition: PrecedenceSvc.h:68
Gaudi::Property< bool > m_showDataFlow
StatusCode initialize() override
Definition: Service.cpp:64
const unsigned int & getAlgoIndex() const
Get algorithm index.
T empty(T...args)
Gaudi::Property< std::string > m_whiteboardSvcName
SmartIF< IPrecedenceSvc > m_precSvc
A shortcut to the Precedence Service.
const std::string & name() const override
The identifying name of the algorithm object.
Definition: Algorithm.cpp:731
const std::string & name() const override
Retrieve name of the service.
Definition: Service.cpp:289
Gaudi::Property< bool > m_showDataDeps
MsgStream & info() const
shortcut for the method msgStream(MSG::INFO)
const DataObjIDColl & outputDataObjs() const override
A service to resolve the task execution precedence.
Definition: PrecedenceSvc.h:21
void activate()
Activate scheduler.
Gaudi::Property< std::string > m_useDataLoader
AlgorithmNode * getAlgorithmNode(const std::string &algoName) const
Get the AlgorithmNode from by algorithm name using graph index.
virtual std::list< IAlgorithm * > getFlatAlgList()=0
Get the flat list of algorithms.
virtual StatusCode simulate(EventSlot &) const =0
Simulate execution flow.
std::atomic< ActivationState > m_isActive
Flag to track if the scheduler is active or not.
std::unordered_map< std::string, unsigned int > m_algname_index_map
Map to bookkeep the information necessary to the name2index conversion.
T resize(T...args)
Gaudi::Property< bool > m_checkDeps
STL class.
Gaudi::Property< bool > m_useIOBoundAlgScheduler
TYPE * get() const
Get interface pointer.
Definition: SmartIF.h:82
std::atomic_int m_freeSlots
Atomic to account for asyncronous updates by the scheduler wrt the rest.
T at(T...args)
virtual void dumpDataFlow() const =0
StatusCode service(const Gaudi::Utils::TypeNameString &name, T *&svc, bool createIf=true)
Templated method to access a service by name.
Definition: ISvcLocator.h:79
MsgStream & error() const
shortcut for the method msgStream(MSG::ERROR)
SmartIF< IAlgResourcePool > m_algResourcePool
Cache for the algorithm resource pool.
MsgStream & warning() const
shortcut for the method msgStream(MSG::WARNING)
Gaudi::Property< bool > m_showControlFlow
This class is used for returning status codes from appropriate routines.
Definition: StatusCode.h:26
SmartIF< IHiveWhiteBoard > m_whiteboard
A shortcut to the whiteboard.
const DataObjIDColl & inputDataObjs() const override
std::vector< std::string > m_algname_vect
Vector to bookkeep the information necessary to the index2name conversion.
Gaudi::Property< std::string > m_IOBoundAlgSchedulerSvcName
virtual void dumpControlFlow() const =0
Dump precedence rules.
T bind(T...args)
bool complete
Flags completion of the event.
Definition: EventSlot.h:39
Gaudi::Property< int > m_threadPoolSize
SmartIF< IThreadPoolSvc > m_threadPoolSvc
SmartIF< IAccelerator > m_IOBoundAlgScheduler
A shortcut to IO-bound algorithm scheduler.
The IAlgorithm is the interface implemented by the Algorithm base class.
Definition: IAlgorithm.h:28
T insert(T...args)
void addDependency(const DataObjID &id, const Gaudi::DataHandle::Mode &mode) override
Base class from which all concrete algorithm classes should be derived.
Definition: Algorithm.h:79
T find_if(T...args)
T size(T...args)
T assign(T...args)
STL class.
Gaudi::Property< bool > m_simulateExecution
MsgStream & debug() const
shortcut for the method msgStream(MSG::DEBUG)
bool isValid() const
Allow for check if smart pointer is valid.
Definition: SmartIF.h:68
T begin(T...args)
Gaudi::Property< bool > m_enableCondSvc
Class representing the event slot.
Definition: EventSlot.h:11
SmartIF< ICondSvc > m_condSvc
A shortcut to service for Conditions handling.
std::vector< EventSlot > m_eventSlots
Vector of events slots.
MsgStream & fatal() const
shortcut for the method msgStream(MSG::FATAL)
virtual size_t getNumberOfStores() const =0
Get the number of &#39;slots&#39;.
T for_each(T...args)
SmartIF< ISvcLocator > & serviceLocator() const override
Retrieve pointer to service locator.
Definition: Service.cpp:292
STL class.
SmartIF< IAlgExecStateSvc > m_algExecStateSvc
Algorithm execution state manager.
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
static GAUDI_API void setNumConcEvents(const std::size_t &nE)
unsigned int getControlFlowNodeCounter() const
Get total number of control flow graph nodes.
T emplace_back(T...args)
std::thread m_thread
The thread in which the activate function runs.
StatusCode AvalancheSchedulerSvc::isStalled ( int  iSlot)
private

Check if the scheduling is in a stall.

Check if we are in present of a stall condition for a particular slot.

This is the case when no actions are present in the actionsQueue, no algorithm is in flight and no algorithm has all of its dependencies satisfied.

Definition at line 798 of file AvalancheSchedulerSvc.cpp.

799 {
800  // Get the slot
801  EventSlot& thisSlot = m_eventSlots[iSlot];
802 
803  if ( m_actionsQueue.empty() && m_algosInFlight == 0 && m_IOBoundAlgosInFlight == 0 &&
805 
806  info() << "About to declare a stall" << endmsg;
807  fatal() << "*** Stall detected! ***\n" << endmsg;
808  dumpSchedulerState( iSlot );
809  // throw GaudiException ("Stall detected",name(),StatusCode::FAILURE);
810 
811  return StatusCode::FAILURE;
812  }
813  return StatusCode::SUCCESS;
814 }
bool algsPresent(State state) const
unsigned int m_IOBoundAlgosInFlight
Number of algoritms presently in flight.
MsgStream & info() const
shortcut for the method msgStream(MSG::INFO)
void dumpSchedulerState(int iSlot)
Dump the state of the scheduler.
AlgsExecutionStates algsStates
Vector of algorithms states.
Definition: EventSlot.h:37
unsigned int m_algosInFlight
Number of algoritms presently in flight.
Class representing the event slot.
Definition: EventSlot.h:11
std::vector< EventSlot > m_eventSlots
Vector of events slots.
MsgStream & fatal() const
shortcut for the method msgStream(MSG::FATAL)
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
tbb::concurrent_bounded_queue< action > m_actionsQueue
Queue where closures are stored and picked for execution.
StatusCode AvalancheSchedulerSvc::m_drain ( )
private

Drain the actions present in the queue.

Update the states for all slots until nothing is left to do.

Definition at line 523 of file AvalancheSchedulerSvc.cpp.

524 {
525 
526  unsigned int slotNum = 0;
527  for ( auto& thisSlot : m_eventSlots ) {
528  if ( not thisSlot.algsStates.allAlgsExecuted() and not thisSlot.complete ) {
529  updateStates( slotNum );
530  }
531  slotNum++;
532  }
533  return StatusCode::SUCCESS;
534 }
std::vector< EventSlot > m_eventSlots
Vector of events slots.
StatusCode updateStates(int si=-1, const std::string &algo_name=std::string())
Loop on algorithm in the slots and promote them to successive states (-1 means all slots...
StatusCode AvalancheSchedulerSvc::popFinishedEvent ( EventContext *&  eventContext)
override

Blocks until an event is availble.

Get a finished event or block until one becomes available.

Definition at line 540 of file AvalancheSchedulerSvc.cpp.

541 {
542  // debug() << "popFinishedEvent: queue size: " << m_finishedEvents.size() << endmsg;
543  if ( m_freeSlots.load() == (int)m_maxEventsInFlight or m_isActive == INACTIVE ) {
544  // debug() << "freeslots: " << m_freeSlots << "/" << m_maxEventsInFlight
545  // << " active: " << m_isActive << endmsg;
546  return StatusCode::FAILURE;
547  } else {
548  // debug() << "freeslots: " << m_freeSlots << "/" << m_maxEventsInFlight
549  // << " active: " << m_isActive << endmsg;
550  m_finishedEvents.pop( eventContext );
551  m_freeSlots++;
552  if ( msgLevel( MSG::DEBUG ) )
553  debug() << "Popped slot " << eventContext->slot() << "(event " << eventContext->evt() << ")" << endmsg;
554  return StatusCode::SUCCESS;
555  }
556 }
ContextID_t slot() const
Definition: EventContext.h:40
std::atomic< ActivationState > m_isActive
Flag to track if the scheduler is active or not.
ContextEvt_t evt() const
Definition: EventContext.h:39
std::atomic_int m_freeSlots
Atomic to account for asyncronous updates by the scheduler wrt the rest.
tbb::concurrent_bounded_queue< EventContext * > m_finishedEvents
Queue of finished events.
MsgStream & debug() const
shortcut for the method msgStream(MSG::DEBUG)
MSG::Level msgLevel() const
get the cached level (originally extracted from the embedded MsgStream)
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
StatusCode AvalancheSchedulerSvc::promoteToAsyncExecuted ( unsigned int  iAlgo,
int  si,
IAlgorithm algo,
EventContext eventContext 
)
private

The call to this method is triggered only from within the IOBoundAlgTask.

Definition at line 1054 of file AvalancheSchedulerSvc.cpp.

1056 {
1057  // Put back the instance
1058  Algorithm* castedAlgo = dynamic_cast<Algorithm*>( algo ); // DP: expose context getter in IAlgo?
1059  if ( !castedAlgo ) fatal() << "[Asynchronous] The casting did not succeed!" << endmsg;
1060  // EventContext* eventContext = castedAlgo->getContext();
1061 
1062  // Check if the execution failed
1063  if ( m_algExecStateSvc->eventStatus( *eventContext ) != EventStatus::Success ) eventFailed( eventContext ).ignore();
1064 
1065  StatusCode sc = m_algResourcePool->releaseAlgorithm( algo->name(), algo );
1066 
1067  if ( !sc.isSuccess() ) {
1068  error() << "[Asynchronous] [Event " << eventContext->evt() << ", Slot " << eventContext->slot() << "] "
1069  << "Instance of algorithm " << algo->name() << " could not be properly put back." << endmsg;
1070  return StatusCode::FAILURE;
1071  }
1072 
1074 
1075  EventSlot& thisSlot = m_eventSlots[si];
1076 
1077  if ( msgLevel( MSG::DEBUG ) )
1078  debug() << "[Asynchronous] Algorithm " << algo->name() << " executed in slot " << si
1079  << ". Algorithms scheduled are " << m_IOBoundAlgosInFlight << endmsg;
1080 
1081  // Schedule an update of the status of the algorithms
1082  auto updateAction = std::bind( &AvalancheSchedulerSvc::updateStates, this, -1, algo->name() );
1083  m_actionsQueue.push( updateAction );
1084  m_updateNeeded = false;
1085 
1086  if ( msgLevel( MSG::DEBUG ) )
1087  debug() << "[Asynchronous] Trying to handle execution result of " << index2algname( iAlgo ) << " on slot " << si
1088  << endmsg;
1089  State state;
1090  if ( algo->filterPassed() ) {
1091  state = State::EVTACCEPTED;
1092  } else {
1093  state = State::EVTREJECTED;
1094  }
1095 
1096  sc = thisSlot.algsStates.updateState( iAlgo, state );
1097 
1098  if ( sc.isSuccess() )
1099  if ( msgLevel( MSG::VERBOSE ) )
1100  verbose() << "[Asynchronous] Promoting " << index2algname( iAlgo ) << " on slot " << si << " to "
1102 
1103  return sc;
1104 }
unsigned int m_IOBoundAlgosInFlight
Number of algoritms presently in flight.
ContextID_t slot() const
Definition: EventContext.h:40
AlgsExecutionStates algsStates
Vector of algorithms states.
Definition: EventSlot.h:37
bool isSuccess() const
Test for a status code of SUCCESS.
Definition: StatusCode.h:50
MsgStream & verbose() const
shortcut for the method msgStream(MSG::VERBOSE)
virtual bool filterPassed() const =0
Did this algorithm pass or fail its filter criterion for the last event?
ContextEvt_t evt() const
Definition: EventContext.h:39
MsgStream & error() const
shortcut for the method msgStream(MSG::ERROR)
SmartIF< IAlgResourcePool > m_algResourcePool
Cache for the algorithm resource pool.
This class is used for returning status codes from appropriate routines.
Definition: StatusCode.h:26
T bind(T...args)
bool m_updateNeeded
Keep track of update actions scheduled.
Base class from which all concrete algorithm classes should be derived.
Definition: Algorithm.h:79
MsgStream & debug() const
shortcut for the method msgStream(MSG::DEBUG)
StatusCode eventFailed(EventContext *eventContext)
Method to check if an event failed and take appropriate actions.
virtual const EventStatus::Status & eventStatus(const EventContext &ctx) const =0
const std::string & index2algname(unsigned int index)
Convert an integer to a name.
Class representing the event slot.
Definition: EventSlot.h:11
std::vector< EventSlot > m_eventSlots
Vector of events slots.
void ignore() const
Definition: StatusCode.h:84
MsgStream & fatal() const
shortcut for the method msgStream(MSG::FATAL)
StatusCode updateStates(int si=-1, const std::string &algo_name=std::string())
Loop on algorithm in the slots and promote them to successive states (-1 means all slots...
MSG::Level msgLevel() const
get the cached level (originally extracted from the embedded MsgStream)
State
Execution states of the algorithms.
virtual StatusCode releaseAlgorithm(const std::string &name, IAlgorithm *&algo)=0
Release a certain algorithm.
SmartIF< IAlgExecStateSvc > m_algExecStateSvc
Algorithm execution state manager.
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
tbb::concurrent_bounded_queue< action > m_actionsQueue
Queue where closures are stored and picked for execution.
static std::map< State, std::string > stateNames
StatusCode updateState(unsigned int iAlgo, State newState)
StatusCode AvalancheSchedulerSvc::promoteToAsyncScheduled ( unsigned int  iAlgo,
int  si 
)
private

Definition at line 950 of file AvalancheSchedulerSvc.cpp.

951 {
952 
954 
955  // bool IOBound = m_precSvc->isBlocking(algName);
956 
957  const std::string& algName( index2algname( iAlgo ) );
958  IAlgorithm* ialgoPtr = nullptr;
959  StatusCode sc( m_algResourcePool->acquireAlgorithm( algName, ialgoPtr ) );
960 
961  if ( sc.isSuccess() ) { // if we managed to get an algorithm instance try to schedule it
962  EventContext* eventContext( m_eventSlots[si].eventContext );
963  if ( !eventContext ) {
964  fatal() << "[Asynchronous] Event context for algorithm " << algName << " is a nullptr (slot " << si << ")"
965  << endmsg;
966  return StatusCode::FAILURE;
967  }
968 
970  // Can we use tbb-based overloaded new-operator for a "custom" task (an algorithm wrapper, not derived from
971  // tbb::task)? it seems it works..
972  IOBoundAlgTask* theTask = new ( tbb::task::allocate_root() )
973  IOBoundAlgTask( ialgoPtr, eventContext, serviceLocator(), m_algExecStateSvc );
974  m_IOBoundAlgScheduler->push( *theTask );
975 
976  if ( msgLevel( MSG::DEBUG ) )
977  debug() << "[Asynchronous] Algorithm " << algName << " was submitted on event " << eventContext->evt()
978  << " in slot " << si << ". algorithms scheduled are " << m_IOBoundAlgosInFlight << endmsg;
979 
980  StatusCode updateSc( m_eventSlots[si].algsStates.updateState( iAlgo, AlgsExecutionStates::SCHEDULED ) );
981 
982  if ( updateSc.isSuccess() )
983  if ( msgLevel( MSG::VERBOSE ) )
984  verbose() << "[Asynchronous] Promoting " << index2algname( iAlgo ) << " to SCHEDULED on slot " << si << endmsg;
985  return updateSc;
986  } else {
987  if ( msgLevel( MSG::DEBUG ) )
988  debug() << "[Asynchronous] Could not acquire instance for algorithm " << index2algname( iAlgo ) << " on slot "
989  << si << endmsg;
990  return sc;
991  }
992 }
Wrapper around I/O-bound Gaudi-algorithms.
virtual StatusCode acquireAlgorithm(const std::string &name, IAlgorithm *&algo, bool blocking=false)=0
Acquire a certain algorithm using its name.
unsigned int m_IOBoundAlgosInFlight
Number of algoritms presently in flight.
MsgStream & verbose() const
shortcut for the method msgStream(MSG::VERBOSE)
This class represents an entry point to all the event specific data.
Definition: EventContext.h:24
STL class.
SmartIF< IAlgResourcePool > m_algResourcePool
Cache for the algorithm resource pool.
This class is used for returning status codes from appropriate routines.
Definition: StatusCode.h:26
SmartIF< IAccelerator > m_IOBoundAlgScheduler
A shortcut to IO-bound algorithm scheduler.
The IAlgorithm is the interface implemented by the Algorithm base class.
Definition: IAlgorithm.h:28
MsgStream & debug() const
shortcut for the method msgStream(MSG::DEBUG)
const std::string & index2algname(unsigned int index)
Convert an integer to a name.
std::vector< EventSlot > m_eventSlots
Vector of events slots.
virtual StatusCode push(IAlgTask &task)=0
MsgStream & fatal() const
shortcut for the method msgStream(MSG::FATAL)
MSG::Level msgLevel() const
get the cached level (originally extracted from the embedded MsgStream)
SmartIF< ISvcLocator > & serviceLocator() const override
Retrieve pointer to service locator.
Definition: Service.cpp:292
SmartIF< IAlgExecStateSvc > m_algExecStateSvc
Algorithm execution state manager.
Gaudi::Property< unsigned int > m_maxIOBoundAlgosInFlight
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
StatusCode AvalancheSchedulerSvc::promoteToExecuted ( unsigned int  iAlgo,
int  si,
IAlgorithm algo,
EventContext eventContext 
)
private

The call to this method is triggered only from within the AlgoExecutionTask.

Definition at line 998 of file AvalancheSchedulerSvc.cpp.

1000 {
1001  // Put back the instance
1002  Algorithm* castedAlgo = dynamic_cast<Algorithm*>( algo ); // DP: expose context getter in IAlgo?
1003  if ( !castedAlgo ) fatal() << "The casting did not succeed!" << endmsg;
1004  // EventContext* eventContext = castedAlgo->getContext();
1005 
1006  // Check if the execution failed
1007  if ( m_algExecStateSvc->eventStatus( *eventContext ) != EventStatus::Success ) eventFailed( eventContext ).ignore();
1008 
1009  Gaudi::Hive::setCurrentContext( eventContext );
1010  StatusCode sc = m_algResourcePool->releaseAlgorithm( algo->name(), algo );
1011 
1012  if ( !sc.isSuccess() ) {
1013  error() << "[Event " << eventContext->evt() << ", Slot " << eventContext->slot() << "] "
1014  << "Instance of algorithm " << algo->name() << " could not be properly put back." << endmsg;
1015  return StatusCode::FAILURE;
1016  }
1017 
1018  m_algosInFlight--;
1019 
1020  EventSlot& thisSlot = m_eventSlots[si];
1021 
1022  if ( msgLevel( MSG::DEBUG ) )
1023  debug() << "Algorithm " << algo->name() << " executed in slot " << si << ". Algorithms scheduled are "
1024  << m_algosInFlight << endmsg;
1025 
1026  // Schedule an update of the status of the algorithms
1027  auto updateAction = std::bind( &AvalancheSchedulerSvc::updateStates, this, -1, algo->name() );
1028  m_actionsQueue.push( updateAction );
1029  m_updateNeeded = false;
1030 
1031  if ( msgLevel( MSG::DEBUG ) )
1032  debug() << "Trying to handle execution result of " << index2algname( iAlgo ) << " on slot " << si << endmsg;
1033  State state;
1034  if ( algo->filterPassed() ) {
1035  state = State::EVTACCEPTED;
1036  } else {
1037  state = State::EVTREJECTED;
1038  }
1039 
1040  sc = thisSlot.algsStates.updateState( iAlgo, state );
1041 
1042  if ( sc.isSuccess() )
1043  if ( msgLevel( MSG::VERBOSE ) )
1044  verbose() << "Promoting " << index2algname( iAlgo ) << " on slot " << si << " to "
1046 
1047  return sc;
1048 }
ContextID_t slot() const
Definition: EventContext.h:40
AlgsExecutionStates algsStates
Vector of algorithms states.
Definition: EventSlot.h:37
bool isSuccess() const
Test for a status code of SUCCESS.
Definition: StatusCode.h:50
MsgStream & verbose() const
shortcut for the method msgStream(MSG::VERBOSE)
virtual bool filterPassed() const =0
Did this algorithm pass or fail its filter criterion for the last event?
ContextEvt_t evt() const
Definition: EventContext.h:39
unsigned int m_algosInFlight
Number of algoritms presently in flight.
MsgStream & error() const
shortcut for the method msgStream(MSG::ERROR)
SmartIF< IAlgResourcePool > m_algResourcePool
Cache for the algorithm resource pool.
This class is used for returning status codes from appropriate routines.
Definition: StatusCode.h:26
T bind(T...args)
GAUDI_API void setCurrentContext(const EventContext *ctx)
bool m_updateNeeded
Keep track of update actions scheduled.
Base class from which all concrete algorithm classes should be derived.
Definition: Algorithm.h:79
MsgStream & debug() const
shortcut for the method msgStream(MSG::DEBUG)
StatusCode eventFailed(EventContext *eventContext)
Method to check if an event failed and take appropriate actions.
virtual const EventStatus::Status & eventStatus(const EventContext &ctx) const =0
const std::string & index2algname(unsigned int index)
Convert an integer to a name.
Class representing the event slot.
Definition: EventSlot.h:11
std::vector< EventSlot > m_eventSlots
Vector of events slots.
void ignore() const
Definition: StatusCode.h:84
MsgStream & fatal() const
shortcut for the method msgStream(MSG::FATAL)
StatusCode updateStates(int si=-1, const std::string &algo_name=std::string())
Loop on algorithm in the slots and promote them to successive states (-1 means all slots...
MSG::Level msgLevel() const
get the cached level (originally extracted from the embedded MsgStream)
State
Execution states of the algorithms.
virtual StatusCode releaseAlgorithm(const std::string &name, IAlgorithm *&algo)=0
Release a certain algorithm.
SmartIF< IAlgExecStateSvc > m_algExecStateSvc
Algorithm execution state manager.
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
tbb::concurrent_bounded_queue< action > m_actionsQueue
Queue where closures are stored and picked for execution.
static std::map< State, std::string > stateNames
StatusCode updateState(unsigned int iAlgo, State newState)
StatusCode AvalancheSchedulerSvc::promoteToFinished ( unsigned int  iAlgo,
int  si 
)
private
StatusCode AvalancheSchedulerSvc::promoteToScheduled ( unsigned int  iAlgo,
int  si 
)
private

Algorithm promotion.

Definition at line 888 of file AvalancheSchedulerSvc.cpp.

889 {
890 
892 
893  const std::string& algName( index2algname( iAlgo ) );
894  IAlgorithm* ialgoPtr = nullptr;
895  StatusCode sc( m_algResourcePool->acquireAlgorithm( algName, ialgoPtr ) );
896 
897  if ( sc.isSuccess() ) { // if we managed to get an algorithm instance try to schedule it
898  EventContext* eventContext( m_eventSlots[si].eventContext );
899  if ( !eventContext ) {
900  fatal() << "Event context for algorithm " << algName << " is a nullptr (slot " << si << ")" << endmsg;
901  return StatusCode::FAILURE;
902  }
903 
904  ++m_algosInFlight;
905  auto promote2ExecutedClosure = std::bind( &AvalancheSchedulerSvc::promoteToExecuted, this, iAlgo,
906  eventContext->slot(), ialgoPtr, eventContext );
907  // Avoid to use tbb if the pool size is 1 and run in this thread
908  if ( -100 != m_threadPoolSize ) {
909 
910  // this parent task is needed to promote an Algorithm as EXECUTED,
911  // it will be started as soon as the child task (see below) is completed
912  tbb::task* triggerAlgoStateUpdate =
913  new ( tbb::task::allocate_root() ) enqueueSchedulerActionTask( this, promote2ExecutedClosure );
914  // setting parent's refcount to 1 is made here only for consistency
915  // (in this case since it is not scheduled explicitly and there it has only one child task)
916  triggerAlgoStateUpdate->set_ref_count( 1 );
917  // the child task that executes an Algorithm
918  tbb::task* algoTask = new ( triggerAlgoStateUpdate->allocate_child() )
919  AlgoExecutionTask( ialgoPtr, eventContext, serviceLocator(), m_algExecStateSvc );
920  // schedule the algoTask
921  tbb::task::enqueue( *algoTask );
922 
923  } else {
924  AlgoExecutionTask theTask( ialgoPtr, eventContext, serviceLocator(), m_algExecStateSvc );
925  theTask.execute();
926  promote2ExecutedClosure();
927  }
928 
929  if ( msgLevel( MSG::DEBUG ) )
930  debug() << "Algorithm " << algName << " was submitted on event " << eventContext->evt() << " in slot " << si
931  << ". Algorithms scheduled are " << m_algosInFlight << endmsg;
932 
933  StatusCode updateSc( m_eventSlots[si].algsStates.updateState( iAlgo, AlgsExecutionStates::SCHEDULED ) );
934 
935  if ( msgLevel( MSG::VERBOSE ) ) dumpSchedulerState( -1 );
936 
937  if ( updateSc.isSuccess() )
938  if ( msgLevel( MSG::VERBOSE ) )
939  verbose() << "Promoting " << index2algname( iAlgo ) << " to SCHEDULED on slot " << si << endmsg;
940  return updateSc;
941  } else {
942  if ( msgLevel( MSG::DEBUG ) )
943  debug() << "Could not acquire instance for algorithm " << index2algname( iAlgo ) << " on slot " << si << endmsg;
944  return sc;
945  }
946 }
virtual StatusCode acquireAlgorithm(const std::string &name, IAlgorithm *&algo, bool blocking=false)=0
Acquire a certain algorithm using its name.
void dumpSchedulerState(int iSlot)
Dump the state of the scheduler.
MsgStream & verbose() const
shortcut for the method msgStream(MSG::VERBOSE)
This class represents an entry point to all the event specific data.
Definition: EventContext.h:24
STL class.
unsigned int m_algosInFlight
Number of algoritms presently in flight.
SmartIF< IAlgResourcePool > m_algResourcePool
Cache for the algorithm resource pool.
This class is used for returning status codes from appropriate routines.
Definition: StatusCode.h:26
T bind(T...args)
Gaudi::Property< int > m_threadPoolSize
The IAlgorithm is the interface implemented by the Algorithm base class.
Definition: IAlgorithm.h:28
MsgStream & debug() const
shortcut for the method msgStream(MSG::DEBUG)
const std::string & index2algname(unsigned int index)
Convert an integer to a name.
StatusCode promoteToExecuted(unsigned int iAlgo, int si, IAlgorithm *algo, EventContext *)
The call to this method is triggered only from within the AlgoExecutionTask.
std::vector< EventSlot > m_eventSlots
Vector of events slots.
MsgStream & fatal() const
shortcut for the method msgStream(MSG::FATAL)
MSG::Level msgLevel() const
get the cached level (originally extracted from the embedded MsgStream)
SmartIF< ISvcLocator > & serviceLocator() const override
Retrieve pointer to service locator.
Definition: Service.cpp:292
SmartIF< IAlgExecStateSvc > m_algExecStateSvc
Algorithm execution state manager.
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
StatusCode AvalancheSchedulerSvc::pushNewEvent ( EventContext eventContext)
override

Make an event available to the scheduler.

Add event to the scheduler.

There are two cases possible: 1) No slot is free. A StatusCode::FAILURE is returned. 2) At least one slot is free. An action which resets the slot and kicks off its update is queued.

Definition at line 445 of file AvalancheSchedulerSvc.cpp.

446 {
447 
448  if ( m_first ) {
449  m_first = false;
450  }
451 
452  if ( !eventContext ) {
453  fatal() << "Event context is nullptr" << endmsg;
454  return StatusCode::FAILURE;
455  }
456 
457  if ( m_freeSlots.load() == 0 ) {
458  if ( msgLevel( MSG::DEBUG ) ) debug() << "A free processing slot could not be found." << endmsg;
459  return StatusCode::FAILURE;
460  }
461 
462  // no problem as push new event is only called from one thread (event loop manager)
463  m_freeSlots--;
464 
465  auto action = [this, eventContext]() -> StatusCode {
466  // Event processing slot forced to be the same as the wb slot
467  const unsigned int thisSlotNum = eventContext->slot();
468  EventSlot& thisSlot = m_eventSlots[thisSlotNum];
469  if ( !thisSlot.complete ) {
470  fatal() << "The slot " << thisSlotNum << " is supposed to be a finished event but it's not" << endmsg;
471  return StatusCode::FAILURE;
472  }
473 
474  debug() << "Executing event " << eventContext->evt() << " on slot " << thisSlotNum << endmsg;
475  thisSlot.reset( eventContext );
476 
477  // Result status code:
479 
480  // promote to CR and DR the initial set of algorithms
481  Cause cs = {Cause::source::Root, "RootDecisionHub"};
482  if ( m_precSvc->iterate( thisSlot, cs ).isFailure() ) {
483  error() << "Failed to call IPrecedenceSvc::iterate for slot " << thisSlotNum << endmsg;
484  result = StatusCode::FAILURE;
485  }
486 
487  if ( this->updateStates( thisSlotNum ).isFailure() ) {
488  error() << "Failed to call AvalancheSchedulerSvc::updateStates for slot " << thisSlotNum << endmsg;
489  result = StatusCode::FAILURE;
490  }
491 
492  return result;
493  }; // end of lambda
494 
495  // Kick off the scheduling!
496  if ( msgLevel( MSG::VERBOSE ) ) {
497  verbose() << "Pushing the action to update the scheduler for slot " << eventContext->slot() << endmsg;
498  verbose() << "Free slots available " << m_freeSlots.load() << endmsg;
499  }
500  m_actionsQueue.push( action );
501 
502  return StatusCode::SUCCESS;
503 }
SmartIF< IPrecedenceSvc > m_precSvc
A shortcut to the Precedence Service.
virtual StatusCode iterate(EventSlot &, const Cause &)=0
Infer the precedence effect caused by an execution flow event.
ContextID_t slot() const
Definition: EventContext.h:40
MsgStream & verbose() const
shortcut for the method msgStream(MSG::VERBOSE)
bool isFailure() const
Test for a status code of FAILURE.
Definition: StatusCode.h:61
ContextEvt_t evt() const
Definition: EventContext.h:39
std::atomic_int m_freeSlots
Atomic to account for asyncronous updates by the scheduler wrt the rest.
MsgStream & error() const
shortcut for the method msgStream(MSG::ERROR)
This class is used for returning status codes from appropriate routines.
Definition: StatusCode.h:26
bool complete
Flags completion of the event.
Definition: EventSlot.h:39
void reset(EventContext *theeventContext)
Reset all resources in order to reuse the slot.
Definition: EventSlot.h:25
MsgStream & debug() const
shortcut for the method msgStream(MSG::DEBUG)
Class representing the event slot.
Definition: EventSlot.h:11
std::vector< EventSlot > m_eventSlots
Vector of events slots.
MsgStream & fatal() const
shortcut for the method msgStream(MSG::FATAL)
StatusCode updateStates(int si=-1, const std::string &algo_name=std::string())
Loop on algorithm in the slots and promote them to successive states (-1 means all slots...
MSG::Level msgLevel() const
get the cached level (originally extracted from the embedded MsgStream)
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
tbb::concurrent_bounded_queue< action > m_actionsQueue
Queue where closures are stored and picked for execution.
StatusCode AvalancheSchedulerSvc::pushNewEvents ( std::vector< EventContext * > &  eventContexts)
override

Definition at line 506 of file AvalancheSchedulerSvc.cpp.

507 {
508  StatusCode sc;
509  for ( auto context : eventContexts ) {
510  sc = pushNewEvent( context );
511  if ( sc != StatusCode::SUCCESS ) return sc;
512  }
513  return sc;
514 }
This class is used for returning status codes from appropriate routines.
Definition: StatusCode.h:26
StatusCode pushNewEvent(EventContext *eventContext) override
Make an event available to the scheduler.
StatusCode AvalancheSchedulerSvc::tryPopFinishedEvent ( EventContext *&  eventContext)
override

Try to fetch an event from the scheduler.

Try to get a finished event, if not available just return a failure.

Definition at line 562 of file AvalancheSchedulerSvc.cpp.

563 {
564  if ( m_finishedEvents.try_pop( eventContext ) ) {
565  if ( msgLevel( MSG::DEBUG ) )
566  debug() << "Try Pop successful slot " << eventContext->slot() << "(event " << eventContext->evt() << ")"
567  << endmsg;
568  m_freeSlots++;
569  return StatusCode::SUCCESS;
570  }
571  return StatusCode::FAILURE;
572 }
ContextID_t slot() const
Definition: EventContext.h:40
ContextEvt_t evt() const
Definition: EventContext.h:39
std::atomic_int m_freeSlots
Atomic to account for asyncronous updates by the scheduler wrt the rest.
tbb::concurrent_bounded_queue< EventContext * > m_finishedEvents
Queue of finished events.
MsgStream & debug() const
shortcut for the method msgStream(MSG::DEBUG)
MSG::Level msgLevel() const
get the cached level (originally extracted from the embedded MsgStream)
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
StatusCode AvalancheSchedulerSvc::updateStates ( int  si = -1,
const std::string algo_name = std::string() 
)
private

Loop on algorithm in the slots and promote them to successive states (-1 means all slots, while empty string means skipping an update of the Control Flow state)

Update the state of the algorithms.

The oldest events are checked before the newest, in order to reduce the event backlog. To check if the event is finished the algorithm checks if:

  • No algorithms have been signed off by the control flow
  • No algorithms have been signed off by the data flow
  • No algorithms have been scheduled

Definition at line 625 of file AvalancheSchedulerSvc.cpp.

626 {
627 
628  m_updateNeeded = true;
629 
630  StatusCode global_sc( StatusCode::SUCCESS );
631 
632  // Sort from the oldest to the newest event
633  // Prepare a vector of pointers to the slots to avoid copies
634  std::vector<EventSlot*> eventSlotsPtrs;
635 
636  // Consider all slots if si <0 or just one otherwise
637  if ( si < 0 ) {
638  const int eventsSlotsSize( m_eventSlots.size() );
639  eventSlotsPtrs.reserve( eventsSlotsSize );
640  for ( auto slotIt = m_eventSlots.begin(); slotIt != m_eventSlots.end(); slotIt++ ) {
641  if ( !slotIt->complete ) eventSlotsPtrs.push_back( &( *slotIt ) );
642  }
643  std::sort( eventSlotsPtrs.begin(), eventSlotsPtrs.end(),
644  []( EventSlot* a, EventSlot* b ) { return a->eventContext->evt() < b->eventContext->evt(); } );
645  } else {
646  eventSlotsPtrs.push_back( &m_eventSlots[si] );
647  }
648 
649  for ( EventSlot* thisSlotPtr : eventSlotsPtrs ) {
650  int iSlot = thisSlotPtr->eventContext->slot();
651 
652  // Cache the states of the algos to improve readability and performance
653  auto& thisSlot = m_eventSlots[iSlot];
654  AlgsExecutionStates& thisAlgsStates = thisSlot.algsStates;
655 
656  // Perform the I->CR->DR transitions
657  if ( !algo_name.empty() ) {
658  Cause cs = {Cause::source::Task, algo_name};
659  if ( m_precSvc->iterate( thisSlot, cs ).isFailure() ) {
660  error() << "Failed to call IPrecedenceSvc::iterate for slot " << iSlot << endmsg;
661  global_sc = StatusCode::FAILURE;
662  }
663  }
664 
665  StatusCode partial_sc( StatusCode::FAILURE, true );
666 
667  // Perform DR->SCHEDULED
668  if ( !m_optimizationMode.empty() ) {
669  auto comp_nodes = [this]( const uint& i, const uint& j ) {
670  return ( m_precSvc->getPriority( index2algname( i ) ) < m_precSvc->getPriority( index2algname( j ) ) );
671  };
673  comp_nodes, std::vector<uint>() );
674  for ( auto it = thisAlgsStates.begin( AlgsExecutionStates::State::DATAREADY );
675  it != thisAlgsStates.end( AlgsExecutionStates::State::DATAREADY ); ++it )
676  buffer.push( *it );
677  /*std::stringstream s;
678  auto buffer2 = buffer;
679  while (!buffer2.empty()) {
680  s << m_precSvc->getPriority(index2algname(buffer2.top())) << ", ";
681  buffer2.pop();
682  }
683  info() << "DRBuffer is: [ " << s.str() << " ] <--" << algo_name << " executed" << endmsg;*/
684 
685  /*while (!buffer.empty()) {
686  partial_sc = promoteToScheduled(buffer.top(), iSlot);
687  if (partial_sc.isFailure()) {
688  if (msgLevel(MSG::VERBOSE))
689  verbose() << "Could not apply transition from "
690  << AlgsExecutionStates::stateNames[AlgsExecutionStates::State::DATAREADY]
691  << " for algorithm " << index2algname(buffer.top()) << " on processing slot " << iSlot << endmsg;
692  if (m_useIOBoundAlgScheduler) {
693  partial_sc = promoteToAsyncScheduled(buffer.top(), iSlot);
694  if (msgLevel(MSG::VERBOSE))
695  if (partial_sc.isFailure())
696  verbose() << "[Asynchronous] Could not apply transition from "
697  << AlgsExecutionStates::stateNames[AlgsExecutionStates::State::DATAREADY]
698  << " for algorithm " << index2algname(buffer.top()) << " on processing slot " << iSlot <<
699  endmsg;
700  }
701  }
702  buffer.pop();
703  }*/
704  while ( !buffer.empty() ) {
705  bool IOBound = false;
706  if ( m_useIOBoundAlgScheduler ) IOBound = m_precSvc->isBlocking( index2algname( buffer.top() ) );
707 
708  if ( !IOBound )
709  partial_sc = promoteToScheduled( buffer.top(), iSlot );
710  else
711  partial_sc = promoteToAsyncScheduled( buffer.top(), iSlot );
712 
713  if ( msgLevel( MSG::VERBOSE ) )
714  if ( partial_sc.isFailure() )
715  verbose() << "Could not apply transition from "
716  << AlgsExecutionStates::stateNames[AlgsExecutionStates::State::DATAREADY] << " for algorithm "
717  << index2algname( buffer.top() ) << " on processing slot " << iSlot << endmsg;
718 
719  buffer.pop();
720  }
721 
722  } else {
723  for ( auto it = thisAlgsStates.begin( AlgsExecutionStates::State::DATAREADY );
724  it != thisAlgsStates.end( AlgsExecutionStates::State::DATAREADY ); ++it ) {
725  uint algIndex = *it;
726 
727  bool IOBound = false;
728  if ( m_useIOBoundAlgScheduler ) IOBound = m_precSvc->isBlocking( index2algname( algIndex ) );
729 
730  if ( !IOBound )
731  partial_sc = promoteToScheduled( algIndex, iSlot );
732  else
733  partial_sc = promoteToAsyncScheduled( algIndex, iSlot );
734 
735  if ( msgLevel( MSG::VERBOSE ) )
736  if ( partial_sc.isFailure() )
737  verbose() << "Could not apply transition from "
738  << AlgsExecutionStates::stateNames[AlgsExecutionStates::State::DATAREADY] << " for algorithm "
739  << index2algname( algIndex ) << " on processing slot " << iSlot << endmsg;
740  }
741  }
742 
743  if ( m_dumpIntraEventDynamics ) {
745  s << algo_name << ", " << thisAlgsStates.sizeOfSubset( State::CONTROLREADY ) << ", "
746  << thisAlgsStates.sizeOfSubset( State::DATAREADY ) << ", " << thisAlgsStates.sizeOfSubset( State::SCHEDULED )
747  << ", " << std::chrono::high_resolution_clock::now().time_since_epoch().count() << "\n";
749  : std::to_string( tbb::task_scheduler_init::default_num_threads() );
750  std::ofstream myfile;
751  myfile.open( "IntraEventConcurrencyDynamics_" + threads + "T.csv", std::ios::app );
752  myfile << s.str();
753  myfile.close();
754  }
755 
756  // Not complete because this would mean that the slot is already free!
757  if ( !thisSlot.complete && m_precSvc->CFRulesResolved( thisSlot ) &&
758  !thisSlot.algsStates.algsPresent( AlgsExecutionStates::CONTROLREADY ) &&
759  !thisSlot.algsStates.algsPresent( AlgsExecutionStates::DATAREADY ) &&
760  !thisSlot.algsStates.algsPresent( AlgsExecutionStates::SCHEDULED ) ) {
761 
762  thisSlot.complete = true;
763  // if the event did not fail, add it to the finished events
764  // otherwise it is taken care of in the error handling already
765  if ( m_algExecStateSvc->eventStatus( *thisSlot.eventContext ) == EventStatus::Success ) {
766  m_finishedEvents.push( thisSlot.eventContext );
767  if ( msgLevel( MSG::DEBUG ) )
768  debug() << "Event " << thisSlot.eventContext->evt() << " finished (slot " << thisSlot.eventContext->slot()
769  << ")." << endmsg;
770  }
771 
772  // now let's return the fully evaluated result of the control flow
773  if ( msgLevel( MSG::DEBUG ) ) debug() << m_precSvc->printState( thisSlot ) << endmsg;
774 
775  thisSlot.eventContext = nullptr;
776  } else {
777  StatusCode eventStalledSC = isStalled( iSlot );
778  if ( !eventStalledSC.isSuccess() ) {
779  m_algExecStateSvc->setEventStatus( EventStatus::AlgStall, *thisSlot.eventContext );
780  eventFailed( thisSlot.eventContext ).ignore();
781  }
782  }
783  } // end loop on slots
784 
785  verbose() << "States Updated." << endmsg;
786 
787  return global_sc;
788 }
T empty(T...args)
T open(T...args)
SmartIF< IPrecedenceSvc > m_precSvc
A shortcut to the Precedence Service.
virtual StatusCode iterate(EventSlot &, const Cause &)=0
Infer the precedence effect caused by an execution flow event.
ContextID_t slot() const
Definition: EventContext.h:40
Gaudi::Property< bool > m_dumpIntraEventDynamics
StatusCode promoteToScheduled(unsigned int iAlgo, int si)
Algorithm promotion.
bool isSuccess() const
Test for a status code of SUCCESS.
Definition: StatusCode.h:50
EventContext * eventContext
Cache for the eventContext.
Definition: EventSlot.h:32
StatusCode isStalled(int si)
Check if the scheduling is in a stall.
T to_string(T...args)
MsgStream & verbose() const
shortcut for the method msgStream(MSG::VERBOSE)
T end(T...args)
Gaudi::Property< std::string > m_optimizationMode
size_t sizeOfSubset(State state) const
StatusCode promoteToAsyncScheduled(unsigned int iAlgo, int si)
bool isFailure() const
Test for a status code of FAILURE.
Definition: StatusCode.h:61
virtual const std::string printState(EventSlot &) const =0
ContextEvt_t evt() const
Definition: EventContext.h:39
Gaudi::Property< bool > m_useIOBoundAlgScheduler
T push_back(T...args)
MsgStream & error() const
shortcut for the method msgStream(MSG::ERROR)
STL class.
The AlgsExecutionStates encodes the state machine for the execution of algorithms within a single eve...
This class is used for returning status codes from appropriate routines.
Definition: StatusCode.h:26
virtual uint getPriority(const std::string &) const =0
Get task priority.
tbb::concurrent_bounded_queue< EventContext * > m_finishedEvents
Queue of finished events.
T close(T...args)
T str(T...args)
virtual void setEventStatus(const EventStatus::Status &sc, const EventContext &ctx)=0
Gaudi::Property< int > m_threadPoolSize
virtual bool CFRulesResolved(EventSlot &) const =0
Check if control flow rules are resolved.
bool m_updateNeeded
Keep track of update actions scheduled.
T size(T...args)
STL class.
MsgStream & debug() const
shortcut for the method msgStream(MSG::DEBUG)
StatusCode eventFailed(EventContext *eventContext)
Method to check if an event failed and take appropriate actions.
T begin(T...args)
Iterator begin(State kind)
virtual const EventStatus::Status & eventStatus(const EventContext &ctx) const =0
const std::string & index2algname(unsigned int index)
Convert an integer to a name.
Class representing the event slot.
Definition: EventSlot.h:11
string s
Definition: gaudirun.py:253
std::vector< EventSlot > m_eventSlots
Vector of events slots.
T sort(T...args)
void ignore() const
Definition: StatusCode.h:84
MSG::Level msgLevel() const
get the cached level (originally extracted from the embedded MsgStream)
virtual bool isBlocking(const std::string &) const =0
Check if a task is CPU-blocking.
SmartIF< IAlgExecStateSvc > m_algExecStateSvc
Algorithm execution state manager.
MsgStream & endmsg(MsgStream &s)
MsgStream Modifier: endmsg. Calls the output method of the MsgStream.
Definition: MsgStream.h:209
T reserve(T...args)
static std::map< State, std::string > stateNames
Iterator end(State kind)

Member Data Documentation

tbb::concurrent_bounded_queue<action> AvalancheSchedulerSvc::m_actionsQueue
private

Queue where closures are stored and picked for execution.

Definition at line 260 of file AvalancheSchedulerSvc.h.

SmartIF<IAlgExecStateSvc> AvalancheSchedulerSvc::m_algExecStateSvc
private

Algorithm execution state manager.

Definition at line 216 of file AvalancheSchedulerSvc.h.

std::unordered_map<std::string, unsigned int> AvalancheSchedulerSvc::m_algname_index_map
private

Map to bookkeep the information necessary to the name2index conversion.

Definition at line 186 of file AvalancheSchedulerSvc.h.

std::vector<std::string> AvalancheSchedulerSvc::m_algname_vect
private

Vector to bookkeep the information necessary to the index2name conversion.

Definition at line 192 of file AvalancheSchedulerSvc.h.

unsigned int AvalancheSchedulerSvc::m_algosInFlight = 0
private

Number of algoritms presently in flight.

Definition at line 224 of file AvalancheSchedulerSvc.h.

SmartIF<IAlgResourcePool> AvalancheSchedulerSvc::m_algResourcePool
private

Cache for the algorithm resource pool.

Definition at line 252 of file AvalancheSchedulerSvc.h.

Gaudi::Property<bool> AvalancheSchedulerSvc::m_checkDeps {this, "CheckDependencies", false, "Runtime check of Algorithm Data Dependencies"}
private

Definition at line 152 of file AvalancheSchedulerSvc.h.

SmartIF<ICondSvc> AvalancheSchedulerSvc::m_condSvc
private

A shortcut to service for Conditions handling.

Definition at line 219 of file AvalancheSchedulerSvc.h.

Gaudi::Property<bool> AvalancheSchedulerSvc::m_dumpIntraEventDynamics
private
Initial value:
{this, "DumpIntraEventDynamics", false,
"Dump intra-event concurrency dynamics to csv file"}

Definition at line 147 of file AvalancheSchedulerSvc.h.

Gaudi::Property<bool> AvalancheSchedulerSvc::m_enableCondSvc {this, "EnableConditions", false, "Enable ConditionsSvc"}
private

Definition at line 157 of file AvalancheSchedulerSvc.h.

std::vector<EventSlot> AvalancheSchedulerSvc::m_eventSlots
private

Vector of events slots.

Definition at line 204 of file AvalancheSchedulerSvc.h.

tbb::concurrent_bounded_queue<EventContext*> AvalancheSchedulerSvc::m_finishedEvents
private

Queue of finished events.

Definition at line 210 of file AvalancheSchedulerSvc.h.

bool AvalancheSchedulerSvc::m_first = true
private

Definition at line 286 of file AvalancheSchedulerSvc.h.

std::atomic_int AvalancheSchedulerSvc::m_freeSlots
private

Atomic to account for asyncronous updates by the scheduler wrt the rest.

Definition at line 207 of file AvalancheSchedulerSvc.h.

unsigned int AvalancheSchedulerSvc::m_IOBoundAlgosInFlight = 0
private

Number of algoritms presently in flight.

Definition at line 227 of file AvalancheSchedulerSvc.h.

SmartIF<IAccelerator> AvalancheSchedulerSvc::m_IOBoundAlgScheduler
private

A shortcut to IO-bound algorithm scheduler.

Definition at line 201 of file AvalancheSchedulerSvc.h.

Gaudi::Property<std::string> AvalancheSchedulerSvc::m_IOBoundAlgSchedulerSvcName {this, "IOBoundAlgSchedulerSvc", "IOBoundAlgSchedulerSvc"}
private

Definition at line 139 of file AvalancheSchedulerSvc.h.

std::atomic<ActivationState> AvalancheSchedulerSvc::m_isActive {INACTIVE}
private

Flag to track if the scheduler is active or not.

Definition at line 177 of file AvalancheSchedulerSvc.h.

size_t AvalancheSchedulerSvc::m_maxAlgosInFlight {1}
private

Definition at line 285 of file AvalancheSchedulerSvc.h.

size_t AvalancheSchedulerSvc::m_maxEventsInFlight {0}
private

Definition at line 284 of file AvalancheSchedulerSvc.h.

Gaudi::Property<unsigned int> AvalancheSchedulerSvc::m_maxIOBoundAlgosInFlight
private
Initial value:
{this, "MaxIOBoundAlgosInFlight", 0,
"Maximum number of simultaneous I/O-bound algorithms"}

Definition at line 140 of file AvalancheSchedulerSvc.h.

Gaudi::Property<std::string> AvalancheSchedulerSvc::m_optimizationMode
private
Initial value:
{this, "Optimizer", "",
"The following modes are currently available: PCE, COD, DRE, E"}

Definition at line 145 of file AvalancheSchedulerSvc.h.

SmartIF<IPrecedenceSvc> AvalancheSchedulerSvc::m_precSvc
private

A shortcut to the Precedence Service.

Definition at line 195 of file AvalancheSchedulerSvc.h.

Gaudi::Property<bool> AvalancheSchedulerSvc::m_showControlFlow
private
Initial value:
{this, "ShowControlFlow", false,
"Show the configuration of all Algorithms and Sequences"}

Definition at line 165 of file AvalancheSchedulerSvc.h.

Gaudi::Property<bool> AvalancheSchedulerSvc::m_showDataDeps
private
Initial value:
{this, "ShowDataDependencies", true,
"Show the INPUT and OUTPUT data dependencies of Algorithms"}

Definition at line 159 of file AvalancheSchedulerSvc.h.

Gaudi::Property<bool> AvalancheSchedulerSvc::m_showDataFlow
private
Initial value:
{this, "ShowDataFlow", false,
"Show the configuration of DataFlow between Algorithms"}

Definition at line 162 of file AvalancheSchedulerSvc.h.

Gaudi::Property<bool> AvalancheSchedulerSvc::m_simulateExecution
private
Initial value:
{
this, "SimulateExecution", false,
"Flag to perform single-pass simulation of execution flow before the actual execution"}

Definition at line 142 of file AvalancheSchedulerSvc.h.

std::mutex AvalancheSchedulerSvc::m_ssMut
staticprivate

Definition at line 318 of file AvalancheSchedulerSvc.h.

std::list< AvalancheSchedulerSvc::SchedulerState > AvalancheSchedulerSvc::m_sState
staticprivate

Definition at line 317 of file AvalancheSchedulerSvc.h.

std::thread AvalancheSchedulerSvc::m_thread
private

The thread in which the activate function runs.

Definition at line 180 of file AvalancheSchedulerSvc.h.

Gaudi::Property<int> AvalancheSchedulerSvc::m_threadPoolSize
private
Initial value:
{
this, "ThreadPoolSize", -1,
"Size of the threadpool initialised by TBB; a value of -1 gives TBB the freedom to choose"}

Definition at line 135 of file AvalancheSchedulerSvc.h.

SmartIF<IThreadPoolSvc> AvalancheSchedulerSvc::m_threadPoolSvc
private

Definition at line 283 of file AvalancheSchedulerSvc.h.

bool AvalancheSchedulerSvc::m_updateNeeded = true
private

Keep track of update actions scheduled.

Definition at line 248 of file AvalancheSchedulerSvc.h.

Gaudi::Property<std::string> AvalancheSchedulerSvc::m_useDataLoader
private
Initial value:
{this, "DataLoaderAlg", "",
"Attribute unmet input dependencies to this DataLoader Algorithm"}

Definition at line 154 of file AvalancheSchedulerSvc.h.

Gaudi::Property<bool> AvalancheSchedulerSvc::m_useIOBoundAlgScheduler
private
Initial value:
{this, "PreemptiveIOBoundTasks", false,
"Turn on preemptive way of scheduling of I/O-bound algorithms"}

Definition at line 149 of file AvalancheSchedulerSvc.h.

SmartIF<IHiveWhiteBoard> AvalancheSchedulerSvc::m_whiteboard
private

A shortcut to the whiteboard.

Definition at line 198 of file AvalancheSchedulerSvc.h.

Gaudi::Property<std::string> AvalancheSchedulerSvc::m_whiteboardSvcName {this, "WhiteboardSvc", "EventDataSvc", "The whiteboard name"}
private

Definition at line 138 of file AvalancheSchedulerSvc.h.


The documentation for this class was generated from the following files: