You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[Pinot 1.2.0] Batch upload for realtime table using Spark fails with error "Creation time must be set for uploaded realtime segment name generator"
#14083
Open
ajeydudhe opened this issue
Sep 25, 2024
· 1 comment
Create schema for realtime table and define the table config having full upsert enabled.
Use the attached job spec for spark-submit command.
Note that there was issue with using http endpoint to fetch table config since it seems to expect the config to be returned only for OFFLINE table. Hence, using the local file path for realtime table. This is another issue.
Following is the segmentNameGeneratorSpec used.
The input file has format: uploaded__myTable__0__20220101T0000Z__suffix
Tried using the type as inputFile and uploadedRealtime
If type = uploadedRealtime then it fails with error "Creation time must be set for uploaded realtime segment name generator"
If type is inputFile and generated segment has same name format then segment gets loaded but on server it fails to load.
segmentNameGeneratorSpec:
# type: Current supported type is 'simple' and 'normalizedDate'.type: uploadedRealtime#type: inputFile# configs: Configs to init SegmentNameGenerator.configs:
#segment.name.prefix: 'uploaded__myTable__0__20220101T0000Z__suffix'#exclude.sequence.id: true# Below is for using file name as segment namefile.path.pattern: '.+/(.+)\.json'segment.name.template: '\${filePathPattern:\1}'
Please confirm on what should be the segmentNameGeneratorSpec.type used to generate segments from json files for realtime table using Spark.
Steps to reproduce
sparkIngestionJobSpec_myTable.yaml.txt
The text was updated successfully, but these errors were encountered: