Ingest Bloomberg Market Data
This page explains how Bloomberg market data is ingested.
Pipelines are provided to ingest Bloomberg market data from .CSV or parquet file formats.
The pipelines that ingest .CSV format are:
-
bbgquoteingest
-
bbgtradeingest
The pipelines that ingest parquet format are:
-
bbgquotepq
-
bbgtradepq
Customize the pipeline
The file targeted by the pipeline for ingestion is configured by the FSI_FILEPATH
environment variable, which is provided when deploying the pipeline using the kxi pm deploy
.
Similarly, the region where the file is located is configured by the FSI_REGION
environment variable.
Pipelines can be customized to meet different requirements. By editing the corresponding spec.q
file for each pipeline, various aspects of the pipeline's operation can be adjusted. Each pipeline reads a file from S3 by default, but this can be changed to a different reader or location by modifying the .fsi.reader
variable. Refer to the available readers for options.
To edit the spec.q
files, first unpack the Accelerator. Then, use a text editor to edit them and make the necessary changes. Finally, push and deploy the updated package.
Unpack
Refer to the following documentation for general commands to unpack and re-package a package.
Edit
Now you can make the changes required to the spec.q
files.
Once changes have been made and packaged, the updated package must be pushed before being deployed.
Push and deploy
Refer to the following documentation on pushing a package.
Read the instructions to deploy a package.
Parquet ingest pipelines
The parquet format ingest pipelines copy the ingested data to the tmp
directory in the sp worker pod as part of the ingest process. This must be configured to be large enough to accommodate the size of the data being ingested, that is, all the data from the current file being ingested. The default size for the tmp
directory is 5MB. To modify this, the k8sPolicy
value should be edited in the pipeline YAML file.
YAML
worker:
image: {}
k8sPolicy:
resources:
tmpDirSize: 5Mi
The same commands referred to in the previous section can be used to unpack and re-package the package in order to modify pipeline yaml files.