Class BaseForExecOpBindJoinWithRequestOps<QueryType extends Query,MemberType extends FederationMember>
java.lang.Object
se.liu.ida.hefquin.engine.queryplan.executable.impl.ops.BaseForExecOps
se.liu.ida.hefquin.engine.queryplan.executable.impl.ops.UnaryExecutableOpBase
se.liu.ida.hefquin.engine.queryplan.executable.impl.ops.BaseForExecOpBindJoin<QueryType,MemberType>
se.liu.ida.hefquin.engine.queryplan.executable.impl.ops.BaseForExecOpBindJoinWithRequestOps<QueryType,MemberType>
- All Implemented Interfaces:
StatsProvider
,ExecutableOperator
,UnaryExecutableOp
- Direct Known Subclasses:
BaseForExecOpBindJoinSPARQL
,ExecOpBindJoinBRTPF
public abstract class BaseForExecOpBindJoinWithRequestOps<QueryType extends Query,MemberType extends FederationMember>
extends BaseForExecOpBindJoin<QueryType,MemberType>
A generic implementation of the bind join algorithm that uses executable
request operators for performing the requests to the federation member.
The implementation is generic in the sense that it works with any type of
request operator. Each concrete implementation that extends this base class
needs to implement the
createExecutableRequestOperator(Iterable)
function to create the request operators with the types of requests that
are specific to that concrete implementation.
This implementation is capable of separating out each input solution mapping
that assigns a blank node to any of the join variables. Then, such solution
mappings are not even considered when creating the requests because they
cannot have any join partners in the results obtained from the federation
member. Of course, in case the algorithm is used under outer-join semantics
these solution mappings are still returned to the output (without joining
them with anything).
Another capability of this implementation is that, instead of simply using
every input block of solution mappings to directly create a corresponding
bind-join request, this implementation can split the input block into
smaller blocks for the requests. On top of that, in case a request operator
fails, this implementation automatically reduces the block size for requests
and, then, tries to re-process (with the reduced request block size) the
input solution mappings for which the request operator failed.
A potential downside of the latter capability is that, if this algorithm
has to execute multiple requests per input block, then these requests are
executed sequentially.-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionprotected class
protected class
-
Field Summary
FieldsModifier and TypeFieldDescriptionprotected static final int
The minimum value to whichrequestBlockSize
can be reduced.protected int
protected int
The number of solution mappings that this operator uses for each of the bind join requests.protected boolean
protected ExecutableOperatorStats
protected ExecutableOperatorStats
protected final boolean
protected final Set<org.apache.jena.sparql.core.Var>
Fields inherited from class se.liu.ida.hefquin.engine.queryplan.executable.impl.ops.BaseForExecOpBindJoin
defaultPreferredInputBlockSize, fm, query
Fields inherited from class se.liu.ida.hefquin.engine.queryplan.executable.impl.ops.UnaryExecutableOpBase
timeAtCurrentProcStart
Fields inherited from class se.liu.ida.hefquin.engine.queryplan.executable.impl.ops.BaseForExecOps
collectExceptions
-
Constructor Summary
ConstructorsConstructorDescriptionBaseForExecOpBindJoinWithRequestOps
(QueryType query, MemberType fm, boolean useOuterJoinSemantics, boolean collectExceptions) BaseForExecOpBindJoinWithRequestOps
(QueryType query, MemberType fm, boolean useOuterJoinSemantics, Set<org.apache.jena.sparql.core.Var> varsInPatternForFM, boolean collectExceptions) -
Method Summary
Modifier and TypeMethodDescriptionprotected void
_process
(Iterable<SolutionMapping> joinableInputSMs, IntermediateResultElementSink sink, ExecutionContext execCxt) protected void
_process
(IntermediateResultBlock input, IntermediateResultElementSink sink, ExecutionContext execCxt) Implementations of this function need to process the given input block and send the produced result elements (if any) to the given sink.protected void
_processWithoutSplittingInputFirst
(List<SolutionMapping> joinableInputSMs, IntermediateResultElementSink sink, ExecutionContext execCxt) protected abstract NullaryExecutableOp
The returned operator should be created such that it throws exceptions instead of collecting them.protected ExecutableOperatorStatsImpl
protected void
extractUnjoinableInputSMs
(Iterable<SolutionMapping> solMaps, Iterable<org.apache.jena.sparql.core.Var> potentialJoinVars, List<SolutionMapping> unjoinable, List<SolutionMapping> joinable) Splits the given collection of solution mappings into two such that the first list contains all the solution mappings that are guaranteed not to have any join partner and the second list contains the rest of the given input solution mappings (which may have join partners).protected boolean
ReducesrequestBlockSize
to its current value divided by 2 if the resulting value would still be greater or equal tominimumRequestBlockSize
.void
Methods inherited from class se.liu.ida.hefquin.engine.queryplan.executable.impl.ops.BaseForExecOpBindJoin
_concludeExecution, preferredInputBlockSize
Methods inherited from class se.liu.ida.hefquin.engine.queryplan.executable.impl.ops.UnaryExecutableOpBase
concludeExecution, getStats, process
Methods inherited from class se.liu.ida.hefquin.engine.queryplan.executable.impl.ops.BaseForExecOps
getExceptionsCaughtDuringExecution, recordExceptionCaughtDuringExecution
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Methods inherited from interface se.liu.ida.hefquin.engine.queryplan.executable.ExecutableOperator
getExceptionsCaughtDuringExecution
-
Field Details
-
useOuterJoinSemantics
protected final boolean useOuterJoinSemantics -
varsInPatternForFM
-
requestBlockSize
protected int requestBlockSizeThe number of solution mappings that this operator uses for each of the bind join requests. This number may be adapted at runtime. -
minimumRequestBlockSize
protected static final int minimumRequestBlockSizeThe minimum value to whichrequestBlockSize
can be reduced.- See Also:
-
requestBlockSizeWasReduced
protected boolean requestBlockSizeWasReduced -
numberOfRequestOpsUsed
protected int numberOfRequestOpsUsed -
statsOfFirstReqOp
-
statsOfLastReqOp
-
-
Constructor Details
-
BaseForExecOpBindJoinWithRequestOps
public BaseForExecOpBindJoinWithRequestOps(QueryType query, MemberType fm, boolean useOuterJoinSemantics, Set<org.apache.jena.sparql.core.Var> varsInPatternForFM, boolean collectExceptions) - Parameters:
varsInPatternForFM
- may be used by sub-classes to provide the set of variables that occur in the graph pattern that the bind join evaluates at the federation member; sub-classes that cannot extract this set in their constructor may passnull
as value for this argument; if provided, this implementation can filter out input solution mappings that contain blank nodes for the join variables and, thus, cannot be joined with the solution mappings obtained via the requests
-
BaseForExecOpBindJoinWithRequestOps
public BaseForExecOpBindJoinWithRequestOps(QueryType query, MemberType fm, boolean useOuterJoinSemantics, boolean collectExceptions)
-
-
Method Details
-
_process
protected void _process(IntermediateResultBlock input, IntermediateResultElementSink sink, ExecutionContext execCxt) throws ExecOpExecutionException Description copied from class:UnaryExecutableOpBase
Implementations of this function need to process the given input block and send the produced result elements (if any) to the given sink. If an exception occurs while processing the input block, this exception needs to either be collected or be thrown, depending on whetherBaseForExecOps.collectExceptions
is set totrue
.- Specified by:
_process
in classUnaryExecutableOpBase
- Throws:
ExecOpExecutionException
-
_process
protected void _process(Iterable<SolutionMapping> joinableInputSMs, IntermediateResultElementSink sink, ExecutionContext execCxt) throws ExecOpExecutionException - Throws:
ExecOpExecutionException
-
_processWithoutSplittingInputFirst
protected void _processWithoutSplittingInputFirst(List<SolutionMapping> joinableInputSMs, IntermediateResultElementSink sink, ExecutionContext execCxt) throws ExecOpExecutionException - Throws:
ExecOpExecutionException
-
reduceRequestBlockSize
protected boolean reduceRequestBlockSize()ReducesrequestBlockSize
to its current value divided by 2 if the resulting value would still be greater or equal tominimumRequestBlockSize
. In this case, this function returnstrue
. Otherwise, the function returnsfalse
without reducingrequestBlockSize
. -
createExecutableRequestOperator
protected abstract NullaryExecutableOp createExecutableRequestOperator(Iterable<SolutionMapping> solMaps) The returned operator should be created such that it throws exceptions instead of collecting them. -
extractUnjoinableInputSMs
protected void extractUnjoinableInputSMs(Iterable<SolutionMapping> solMaps, Iterable<org.apache.jena.sparql.core.Var> potentialJoinVars, List<SolutionMapping> unjoinable, List<SolutionMapping> joinable) Splits the given collection of solution mappings into two such that the first list contains all the solution mappings that are guaranteed not to have any join partner and the second list contains the rest of the given input solution mappings (which may have join partners). Typically, the solution mappings that are guaranteed not to have join partners are the ones that have a blank node for one of the join variables. -
resetStats
public void resetStats()- Specified by:
resetStats
in interfaceStatsProvider
- Overrides:
resetStats
in classUnaryExecutableOpBase
-
createStats
- Overrides:
createStats
in classBaseForExecOpBindJoin<QueryType extends Query,
MemberType extends FederationMember>
-