Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feature: Mock image dataset for CSV loading (CSV pt. 1) #214

Merged
merged 7 commits into from
Nov 20, 2024

Conversation

ShrimpCryptid
Copy link
Contributor

@ShrimpCryptid ShrimpCryptid commented Nov 14, 2024

Problem

Part 1 of around 2-3 PRs I'll be opening to complete #213, "load CSV data".
This PR is very bare-bones because I'm trying to keep my PRs at a more manageable size. The next PR will include more robust unit testing and handling for BFF-specific data formats.

Estimated review size: large, 30 minutes

Follow up PRs:

Solution

  • Adds papaparse as a dependency.
  • Adds a new ImageDataset class, CsvRequest.
    • CSVRequest can parse a CSV string and extract string and numeric features from it.

Type of change

  • New feature (non-breaking change which adds functionality)

@ShrimpCryptid ShrimpCryptid self-assigned this Nov 14, 2024
@ShrimpCryptid ShrimpCryptid added the enhancement New feature or request label Nov 14, 2024
@@ -0,0 +1,39 @@
import { expect } from "chai";
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Unit tests are purposefully very basic right now! Will be adding more robust checks in a future PR.

@ShrimpCryptid ShrimpCryptid marked this pull request as ready for review November 14, 2024 21:59
@ShrimpCryptid ShrimpCryptid requested a review from a team as a code owner November 14, 2024 21:59
@ShrimpCryptid ShrimpCryptid requested review from interim17 and ascibisz and removed request for a team November 14, 2024 21:59
@ShrimpCryptid ShrimpCryptid changed the title feature: CSV Request image dataset feature: Mock image dataset for CSV loading (CSV pt. 1) Nov 19, 2024
* - Check for behavior when there is no discrete feature column -> validate groupby
* - Check for handling of BFF-specific column names (they should be remapped)
* - Check that metadata columns are parsed correctly
* - Check that metadata-related columns are not parsed as features

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Check when cellIds collide with row numbers

@ShrimpCryptid ShrimpCryptid removed the request for review from ascibisz November 19, 2024 18:09
Copy link

@interim17 interim17 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Don't feel super familiar/confident about the data that's coming in here, but LGTM! Nice work.

Comment on lines +6 to +9
"\npotato,https://example.com/1/raw.ome.zarr,https://example.com/1.png,1,2,3,A" +
"\ngarbanzo,https://example.com/2/raw.ome.zarr,https://example.com/2.jpeg,7,3.4,1,B" +
"\nturnip,https://example.com/3/raw.ome.zarr,https://example.com/3.jpeg,4,5,6,B" +
"\nrutabaga,https://example.com/4/raw.ome.zarr,https://example.com/4.jpeg,9,2.8,NaN,C";

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Delicious fall themed tuber data

@ShrimpCryptid ShrimpCryptid merged commit adff277 into main Nov 20, 2024
1 check passed
@ShrimpCryptid ShrimpCryptid deleted the feature/csv-loader-1 branch November 20, 2024 02:08
@ShrimpCryptid ShrimpCryptid linked an issue Nov 25, 2024 that may be closed by this pull request
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

prototype direct spreadsheet import
3 participants