No more non-unique column names, or non-queryable columns in list_columns #70

ivirshup · 2024-04-08T14:33:57Z

Fixes list_columns includes columns from "metadata" and "chromosome" table #42

…umns

gamazeps · 2024-04-08T18:04:46Z

src/genomic_features/ensembl/ensembldb.py

        elif isinstance(tables, str):
            tables = [tables]  # list of tables names (only one)
-        columns = [c for t in tables for c in self.db.table(t).columns]
+


columns = [c for t in tables for c in self.db.table(t).columns] return list(sorted(set(columns)))

is shorter and more readable no ?

It even works with the current tests

I think things are grouped a little more logically when it's ordered by table:

Example

In [2]: ensdb = gf.ensembl.annotation("Hsapiens", 108) In [3]: ensdb.list_columns() Out[3]: ['gene_id', 'gene_name', 'gene_biotype', 'gene_seq_start', 'gene_seq_end', 'seq_name', 'seq_strand', 'seq_coord_system', 'description', 'gene_id_version', 'canonical_transcript', 'tx_id', 'tx_biotype', 'tx_seq_start', 'tx_seq_end', 'tx_cds_seq_start', 'tx_cds_seq_end', 'tx_support_level', 'tx_id_version', 'gc_content', 'tx_external_name', 'tx_is_canonical', 'exon_id', 'exon_idx', 'exon_seq_start', 'exon_seq_end', 'seq_length', 'is_circular', 'protein_id', 'protein_sequence', 'uniprot_id', 'uniprot_db', 'uniprot_mapping_type', 'protein_domain_id', 'protein_domain_source', 'interpro_accession', 'prot_dom_start', 'prot_dom_end', 'entrezid'] In [4]: sorted(ensdb.list_columns()) Out[4]: ['canonical_transcript', 'description', 'entrezid', 'exon_id', 'exon_idx', 'exon_seq_end', 'exon_seq_start', 'gc_content', 'gene_biotype', 'gene_id', 'gene_id_version', 'gene_name', 'gene_seq_end', 'gene_seq_start', 'interpro_accession', 'is_circular', 'prot_dom_end', 'prot_dom_start', 'protein_domain_id', 'protein_domain_source', 'protein_id', 'protein_sequence', 'seq_coord_system', 'seq_length', 'seq_name', 'seq_strand', 'tx_biotype', 'tx_cds_seq_end', 'tx_cds_seq_start', 'tx_external_name', 'tx_id', 'tx_id_version', 'tx_is_canonical', 'tx_seq_end', 'tx_seq_start', 'tx_support_level', 'uniprot_db', 'uniprot_id', 'uniprot_mapping_type']

codecov-commenter · 2024-04-10T08:57:20Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 93.06%. Comparing base (bae82d2) to head (66c32d8).
Report is 5 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #70      +/-   ##
==========================================
+ Coverage   92.94%   93.06%   +0.12%     
==========================================
  Files           6        6              
  Lines         340      346       +6     
==========================================
+ Hits          316      322       +6     
  Misses         24       24

Files with missing lines	Coverage Δ
src/genomic_features/ensembl/ensembldb.py	`93.10% <100.00%> (+0.21%)`	⬆️

No more non-unique column names, or non-queryable columns in list_col…

5d8b6f7

…umns

ivirshup added this to the 0.1.0 milestone Apr 8, 2024

gamazeps reviewed Apr 8, 2024

View reviewed changes

ivirshup added 2 commits April 9, 2024 08:04

Fix order of columns returned by default

2101f0f

Merge branch 'main' into list-columns-unique

66c32d8

ivirshup enabled auto-merge (squash) April 10, 2024 08:54

ivirshup merged commit 66148be into scverse:main Apr 10, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

No more non-unique column names, or non-queryable columns in list_columns #70

No more non-unique column names, or non-queryable columns in list_columns #70

ivirshup commented Apr 8, 2024

gamazeps Apr 8, 2024 •

edited

Loading

ivirshup Apr 9, 2024 •

edited

Loading

codecov-commenter commented Apr 10, 2024 •

edited

Loading

No more non-unique column names, or non-queryable columns in list_columns #70

No more non-unique column names, or non-queryable columns in list_columns #70

Conversation

ivirshup commented Apr 8, 2024

gamazeps Apr 8, 2024 • edited Loading

Choose a reason for hiding this comment

ivirshup Apr 9, 2024 • edited Loading

Choose a reason for hiding this comment

codecov-commenter commented Apr 10, 2024 • edited Loading

Codecov Report

gamazeps Apr 8, 2024 •

edited

Loading

ivirshup Apr 9, 2024 •

edited

Loading

codecov-commenter commented Apr 10, 2024 •

edited

Loading