Skip to content

Commit

Permalink
update search terms and ownership entries
Browse files Browse the repository at this point in the history
  • Loading branch information
jlopp committed Nov 27, 2024
1 parent 00fca5c commit b973316
Show file tree
Hide file tree
Showing 2 changed files with 57 additions and 18 deletions.
8 changes: 4 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -319,7 +319,7 @@ NOTE: If you open a link to a Senator's disclosure, you need to paste the URL in
| Lucas, Frank | R | OK | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10041075.pdf) |
| Luetkemeyer, Blaine | R | MO | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/8218118.pdf) |
| Luján, Ben | D | NM | Senate | NO | [2020](https://efdsearch.senate.gov/search/print/paper/71127e2f-b5ea-471a-841e-e5ec1102d452/) |
| Lummis, Cynthia | R | WY | Senate | YES | [2020](https://efdsearch.senate.gov/search/view/annual/5a9f95fe-06e6-4abf-866f-d1d174e510e9/) | $100K - $250K of bitcoin owned |
| Lummis, Cynthia | R | WY | Senate | YES | [2020](https://efdsearch.senate.gov/search/view/annual/5a9f95fe-06e6-4abf-866f-d1d174e510e9/) | $100K - $250K BTC, moved to blind trust in 2021 |
| Luttrell, Morgan | R | TX | House | - | - |
| Lynch, Stephen | D | MA | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10041076.pdf) |
| Mace, Nancy | R | SC | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10043329.pdf) |
Expand All @@ -335,7 +335,7 @@ NOTE: If you open a link to a Senator's disclosure, you need to paste the URL in
| Matsui, Doris | D | CA | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/8218199.pdf) |
| McBath, Lucy | D | GA | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10042414.pdf) |
| McCarthy, Kevin | R | CA | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10041107.pdf) |
| McCaul, Michael | R | TX | House | YES | [2021](https://disclosures-clerk.house.gov/public_disc/ptr-pdfs/2021/8217869.pdf) | $1K-$15K GBTC in children's trust |
| McCaul, Michael | R | TX | House | NO | [2023](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2023/8220569.pdf) |
| McClain, Lisa | R | MI | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10043335.pdf) |
| McClellan, Jennifer | D | VA | House | - | - |
| McClintock, Tom | R | CA | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10043558.pdf) |
Expand All @@ -361,7 +361,7 @@ NOTE: If you open a link to a Senator's disclosure, you need to paste the URL in
| Molinaro, Marcus | R | NY | House | - | - |
| Moolenaar, John | R | MI | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10042174.pdf) |
| Mooney, Alexander | R | WV | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10043589.pdf) |
| Moore, Barry | R | AL | House | YES | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10040681.pdf) | $1-15k "crypto currency" |
| Moore, Barry | R | AL | House | NO | [2023](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2023/10057716.pdf) |
| Moore, Blake | R | UT | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10041325.pdf) |
| Moore, Gwen | D | WI | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/8218087.pdf) |
| Moran, Jerry | R | KS | Senate | NO | [2020](https://efdsearch.senate.gov/search/view/annual/6cc69cc0-afd4-4328-9b15-cd1ad66cede5/) |
Expand Down Expand Up @@ -521,7 +521,7 @@ NOTE: If you open a link to a Senator's disclosure, you need to paste the URL in
| Underwood, Lauren | D | IL | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10041518.pdf) |
| Valadao, David | R | CA | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10042949.pdf) | No assets disclosed |
| Vance, J.D. | R | OH | Senate | YES | [2023](https://efdsearch.senate.gov/search/view/annual/2f2f5bbc-50c5-4b00-acd9-85870f9e349c/) | $100K-$250K BTC |
| Van Drew, Jefferson | R | NJ | House | YES | [2023](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/8218181.pdf) | $100K-$250K Grayscale Trust |
| Van Drew, Jefferson | R | NJ | House | YES | [2023](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/8218181.pdf) | $100K-$250K GBTC |
| Van Duyne, Beth | R | TX | House | NO | [2020](https://disclosures-clerk.house.gov/public_disc/financial-pdfs/2020/10041645.pdf) |
| Van Hollen, Chris | D | MD | Senate | NO | [2020](https://efdsearch.senate.gov/search/view/paper/cd3c4664-9884-4b80-bed1-f14620a0f77c/) |
| Van Orden, Derrick | R | WI | House | - | - |
Expand Down
67 changes: 53 additions & 14 deletions automated_updates/config.py
Original file line number Diff line number Diff line change
Expand Up @@ -4,36 +4,66 @@
'cryptocurrency',
'blockchain',
'crypto',
'digital asset',
'virtual currency',
# bitcoin
'btc',
'bitcoin',
# etfs
'fbtc ',
'ibit ',
'fbtc',
'ibit',
'arkb',
'bitb',
'grayscale',
'greyscale',
'gbtc',
'bito ',
# bitcoin/crypto proxy companies
'coinbase',
'microstrategy',
'mstr ',
'marathon digital',
'riot blockchain',
'cleanspark',
'core scientific',
'bito',
# miners
'argo blockchain',
'arbk',
'bitdeer',
'btdr',
'bitfarms',
'bitf',
'canaan',
#'can', // too many false positives
'cipher mining',
'cifr',
'cleanspark',
'clsk',
'hive digital technologies',
'\(hive\)',
'hut 8 mining',
'\(hut\)',
'iris energy',
'iren',
'marathon digital',
'\(mara\)',
'riot blockchain',
'\(riot\)',
'terawulf',
'wulf',
# bitcoin/crypto proxy companies
'coinbase',
'\(coin\)',
'core scientific',
'corz',
'galaxy digital',
'glxy',
'microstrategy',
'mstr',
'semler scientific',
'smlr',
'block, Inc',
'block inc',
'\(sq\)',
# shitcoins
'ethereum',
'eth ',
' eth ',
'litecoin',
'ltc ',
'ltc',
'ripple',
'xrp ',
'xrp',
'cardano',
'polkadot',
'chainlink',
Expand All @@ -46,7 +76,16 @@

# terms to be explicitly excluded
bitcoin_crypto_terms_false_positives = [
'Armstrong',
'BTC LifePath',
'H&R',
'H & R',
'Marathon Oil',
'Marathon Petroleum',
'Marriott',
'Pershing',
'Quonset',
'Squibb'
]

source_data_dir = './all_source_data/'
Expand Down

0 comments on commit b973316

Please sign in to comment.