rzvde-g8-kirsanov-dmitry - Details for Query 37

Submitted Time: 2026/04/08 09:00:30
Duration: 0.3 s
Succeeded Jobs: 62 63

Show the Stage ID and Task ID that corresponds to the max metric

digraph G { 0 [labelType="html" label=" AdaptiveSparkPlan "]; subgraph cluster1 { isCluster="true"; label="WholeStageCodegen (2)\n \nduration: 0 ms"; 2 [labelType="html" label="HashAggregate time in aggregation build: 0 ms number of output rows: 1"]; } 3 [labelType="html" label="Exchange shuffle records written: 2 local merged chunks fetched: 0 shuffle write time total (min, med, max (stageId: taskId)) 0 ms (0 ms, 0 ms, 0 ms (stage 108.0: task 94)) remote merged bytes read: 0.0 B local merged blocks fetched: 0 corrupt merged block chunks: 0 remote merged reqs duration: 0 ms remote merged blocks fetched: 0 records read: 2 local bytes read: 118.0 B fetch wait time: 0 ms remote bytes read: 0.0 B merged fetch fallback count: 0 local blocks read: 2 remote merged chunks fetched: 0 remote blocks read: 0 data size total (min, med, max (stageId: taskId)) 32.0 B (0.0 B, 16.0 B, 16.0 B (stage 108.0: task 94)) local merged bytes read: 0.0 B number of partitions: 1 remote reqs duration: 0 ms remote bytes read to disk: 0.0 B shuffle bytes written total (min, med, max (stageId: taskId)) 118.0 B (0.0 B, 59.0 B, 59.0 B (stage 108.0: task 94))"]; subgraph cluster4 { isCluster="true"; label="WholeStageCodegen (1)\n \nduration: total (min, med, max (stageId: taskId))\n285 ms (99 ms, 186 ms, 186 ms (stage 108.0: task 93))"; 5 [labelType="html" label="HashAggregate time in aggregation build total (min, med, max (stageId: taskId)) 284 ms (99 ms, 185 ms, 185 ms (stage 108.0: task 93)) number of output rows: 2"]; 6 [labelType="html" label=" Project "]; 7 [labelType="html" label="Filter number of output rows: 483"]; } 8 [labelType="html" label="Scan csv number of output rows: 31,257 number of files read: 1 metadata time: 0 ms size of files read: 5.8 MiB"]; 2->0; 3->2; 5->3; 6->5; 7->6; 8->7; }

AdaptiveSparkPlan isFinalPlan=true

HashAggregate(keys=[], functions=[count(1)])

WholeStageCodegen (2)

Exchange SinglePartition, ENSURE_REQUIREMENTS, [plan_id=1379]

HashAggregate(keys=[], functions=[partial_count(1)])

Project

Filter ((isnull(ride_id#9482) OR (ended_at#9485 <= started_at#9484)) OR NOT (((isnotnull(end_station_id#9489) AND isnotnull(start_station_id#9487)) AND NOT (end_station_id#9489 = start_station_id#9487)) <=> true))

WholeStageCodegen (1)

FileScan csv [ride_id#9482,started_at#9484,ended_at#9485,start_station_id#9487,end_station_id#9489] Batched: false, DataFilters: [((isnull(ride_id#9482) OR (ended_at#9485 <= started_at#9484)) OR NOT (((isnotnull(end_station_id..., Format: CSV, Location: InMemoryFileIndex(1 paths)[s3a://rzvde-g8-kirsanov-dmitry/raw/citibike_data/202502/202502-citibik..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<ride_id:string,started_at:timestamp,ended_at:timestamp,start_station_id:string,end_station...

Details

== Physical Plan ==
AdaptiveSparkPlan (13)
+- == Final Plan ==
   * HashAggregate (7)
   +- ShuffleQueryStage (6), Statistics(sizeInBytes=32.0 B, rowCount=2)
      +- Exchange (5)
         +- * HashAggregate (4)
            +- * Project (3)
               +- * Filter (2)
                  +- Scan csv  (1)
+- == Initial Plan ==
   HashAggregate (12)
   +- Exchange (11)
      +- HashAggregate (10)
         +- Project (9)
            +- Filter (8)
               +- Scan csv  (1)


(1) Scan csv 
Output [5]: [ride_id#9482, started_at#9484, ended_at#9485, start_station_id#9487, end_station_id#9489]
Batched: false
Location: InMemoryFileIndex [s3a://rzvde-g8-kirsanov-dmitry/raw/citibike_data/202502/202502-citibike-tripdata-part00.csv]
ReadSchema: struct<ride_id:string,started_at:timestamp,ended_at:timestamp,start_station_id:string,end_station_id:string>

(2) Filter [codegen id : 1]
Input [5]: [ride_id#9482, started_at#9484, ended_at#9485, start_station_id#9487, end_station_id#9489]
Condition : ((isnull(ride_id#9482) OR (ended_at#9485 <= started_at#9484)) OR NOT (((isnotnull(end_station_id#9489) AND isnotnull(start_station_id#9487)) AND NOT (end_station_id#9489 = start_station_id#9487)) <=> true))

(3) Project [codegen id : 1]
Output: []
Input [5]: [ride_id#9482, started_at#9484, ended_at#9485, start_station_id#9487, end_station_id#9489]

(4) HashAggregate [codegen id : 1]
Input: []
Keys: []
Functions [1]: [partial_count(1)]
Aggregate Attributes [1]: [count#10049L]
Results [1]: [count#10050L]

(5) Exchange
Input [1]: [count#10050L]
Arguments: SinglePartition, ENSURE_REQUIREMENTS, [plan_id=1379]

(6) ShuffleQueryStage
Output [1]: [count#10050L]
Arguments: 0

(7) HashAggregate [codegen id : 2]
Input [1]: [count#10050L]
Keys: []
Functions [1]: [count(1)]
Aggregate Attributes [1]: [count(1)#10046L]
Results [1]: [count(1)#10046L AS count#10047L]

(8) Filter
Input [5]: [ride_id#9482, started_at#9484, ended_at#9485, start_station_id#9487, end_station_id#9489]
Condition : ((isnull(ride_id#9482) OR (ended_at#9485 <= started_at#9484)) OR NOT (((isnotnull(end_station_id#9489) AND isnotnull(start_station_id#9487)) AND NOT (end_station_id#9489 = start_station_id#9487)) <=> true))

(9) Project
Output: []
Input [5]: [ride_id#9482, started_at#9484, ended_at#9485, start_station_id#9487, end_station_id#9489]

(10) HashAggregate
Input: []
Keys: []
Functions [1]: [partial_count(1)]
Aggregate Attributes [1]: [count#10049L]
Results [1]: [count#10050L]

(11) Exchange
Input [1]: [count#10050L]
Arguments: SinglePartition, ENSURE_REQUIREMENTS, [plan_id=1365]

(12) HashAggregate
Input [1]: [count#10050L]
Keys: []
Functions [1]: [count(1)]
Aggregate Attributes [1]: [count(1)#10046L]
Results [1]: [count(1)#10046L AS count#10047L]

(13) AdaptiveSparkPlan
Output [1]: [count#10047L]
Arguments: isFinalPlan=true