Reads people counter files with different data formats (see eventmappers package) and transforms it into a data stream. Reason to do that is that sometimes there are just the files and sometimes there can be real time streaming data. We unify the data format by transforming everything into a data stream to ensure homogeneous processing. Please beware when you add your own data format to not join all formats on the data stream just after creating a stream, because often different formats lack different properties (e.g. one lacks line information the next one lacks delay information and similar) what leads that different post-processing/enrichment steps are required to have all streams populated with equal properties for further processing in a unified way (see the linematching and enrichment projects how missing properties can be filled.).
#HOW To USE Start the job (BlockStorageIngestorApplication) with the different parameters:
- inputFolder = the folder where the Data Files are searched
- timeZone = the time zone where the transport provider operates - default: Europe/Berlin
- provider = the provider where the data is from to track the origin in case different providers share the same data. Default: "defaultProvider";
- kafkaBroker = Default: "localhost:9092";
- outputTopic = The output topic: "iu-fahrbar-prod-ingest-mgelectronics-flatfilecountdata-v1";
- maxWaitingTime = Waiting time for input file ordering in seconds: Default:"5";
In order to add another data format one can just use one of the event mapper examples and use it out of the box or use the event mappers as a template to create a data mapper for an own format.
Subsequently, one can use the Resource Processor as a template to create a configurable processor reading files form storage and using the suitable mapper.
--add-opens java.base/java.lang=ALL-UNNAMED --add-opens java.base/java.util=ALL-UNNAMED --add-opens java.base/java.io=ALL-UNNAMED
-Dexecution.runtime-mode=BATCH
- Deploys in the fahrbar-common namespace
- Mounts the buckets to
- /buckets/XXX READONLY
- /buckets/XXX READONLY
- /buckets/druidimport READWRITE
- Checkpoint/Savepoint Dir
- /checkpoints/flink/externalized-checkpoints
- /checkpoints/flink/savepoints
docker build --no-cache -t iu-flatfile-streamer:local -f iu-occupancy-flatfile-streamer/Dockerfile .