This is an image-based AWS Lambda, using ndr_parquet to generate parquet files from numerous filetypes.
To install Apache Arrow on a mac using homebrew, run:
$ brew install apache-arrow
and then run:
$ bundle install
To make the AWS Linux docker container with Apache Arrow, run:
$ make image
and then run:
$ make install
To convert the sample collection spreadsheet ABC_Collection-June-2020_03.xlsm
to parquet format, using the national_collection.yml
mapping file, run:
$ ruby import.rb
or to convert the spreadsheet in the docker container, run:
$ make import
Bug reports and pull requests are welcome on GitHub at https://github.com/timgentry/ndr_import-parquet-example. This project is intended to be a safe, welcoming space for collaboration, and contributors are expected to adhere to the code of conduct.
The gem is available as open source under the terms of the MIT License.
Everyone interacting in the ndr_import-parquet-example project's codebases, issue trackers, chat rooms and mailing lists is expected to follow the code of conduct.