Skip to content

An image based AWS Lambda, using ndr_parquet to generate parquet files from numerous filetypes.

License

Notifications You must be signed in to change notification settings

HealthDataInsight/ndr_parquet-lambda

Repository files navigation

ndr_parquet-lambda

This is an image-based AWS Lambda, using ndr_parquet to generate parquet files from numerous filetypes.

Installation

To install Apache Arrow on a mac using homebrew, run:

$ brew install apache-arrow

and then run:

$ bundle install

AWS Linux

To make the AWS Linux docker container with Apache Arrow, run:

$ make image

and then run:

$ make install

Usage

To convert the sample collection spreadsheet ABC_Collection-June-2020_03.xlsm to parquet format, using the national_collection.yml mapping file, run:

$ ruby import.rb

or to convert the spreadsheet in the docker container, run:

$ make import

Contributing

Bug reports and pull requests are welcome on GitHub at https://github.com/timgentry/ndr_import-parquet-example. This project is intended to be a safe, welcoming space for collaboration, and contributors are expected to adhere to the code of conduct.

License

The gem is available as open source under the terms of the MIT License.

Code of Conduct

Everyone interacting in the ndr_import-parquet-example project's codebases, issue trackers, chat rooms and mailing lists is expected to follow the code of conduct.

About

An image based AWS Lambda, using ndr_parquet to generate parquet files from numerous filetypes.

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •