We haven't been updating this, so rather than see a flood of security notices, we have archived the repository. May it still be of value to the community in the future.
PubMed2Knowledge (PM2K) is a tool that allows users to explore relationships between two topics of interest by leveraging PubMed citations indexed with MeSH terms. The user can quickly learn how two topics (MeSH terms) are related across the PubMed citation corpus. For example, suppose I wanted to know "the where" of publications about Zika virus. In such a query, I may choose to begin my analysis by selecting one or more of the relevant MeSH terms, in this case, "Zika Virus" and "Zika Virus Infection", to create my records of interest. Since I would like to know about geographic information among these publications, I may decide to enter "Geographic Locations" as my second input parameter. The tool would analyze the data by quantifying the citations across the first child nodes of "Geographic Locations", which can be explored using the Mesh Browser (for this particular term, the children can be seen here PLEASE NOTE: PubMed citations are not immediately indexed with MeSH terms, so the latest abstracts will not be included in your analysis.
From the National Library of Medicine website: "The Medical Subject Headings (MeSH) thesaurus is a controlled and hierarchically-organized vocabulary produced by the National Library of Medicine. It is used for indexing, cataloging, and searching of biomedical and health-related information." There are plenty of resources for understanding and leveraging MeSH from the dedicated NLM page. The file used in this application to make the matches between the primary and secondary topic of interest is the RDF found on the MeSH Downloads page (downloaded 2019-10-16).
- User supplies two MeSH-based input parameters, the first is the term that generates the "anchor" search, the second is the "exploratory" parameter.
- A PubMed is search executed via
eSearch
from the Biopython module - UIDs are posted to history server via
ePost
- The MEDLINE-formatted citations are retrieved via
eFetch
- A function in parallel extracts all matching MeSH terms based on paramater two, which are then used to filter the corpus set.
- The MeSH terms from the search corpus matching paramater two are counted for their frequency in the original record set.
- The child nodes of the exploratoary parameter are displayed in a bar chart
A user starts a search using a MeSH term of interest. A corpus of citations is retrieved. The number of those citations that intersect (match) the corpus are quantified and displayed in a chart.
The beta web app can be found here: http://pubmeta.us/#/
Using PM2K is really straightforward; enter your "anchor" search MeSH terms in box one, and select the MeSH term of interest in box two you would like to compare/analyze against. Below is an example analysis and output:
This figures illustrates the output for the example proposed above, displaying the regions tagged on the publications that were about Zika Virus.
- The MeSH terms can be found with autocomplete after the user begins to type
- Support for Entry Terms & SCRs
- Date filtering
- Time series analysis
- Support for publication type (PT) search
- Support PMID list upload instead of search
- Send user to PubMed to perform a search based on click on bar in output figure
- Kurtis Haro, PhD NLM/NCBI
- Eric Moyer, NLM/NCBI
- Evgeny Ivanchenko, PhD NLM/NCBI
- Victor Joukov, NLM/NCBI
- Preeti G. Kochar, PhD NLM/LO/BSD
+---------+
| |
| Browser |
| |
+----+----+
|
|
|
v
+----+-----+
| |
| Vue |
| |
| Frontend |
| |
+----+-----+
|
|
v
+----+----+ +----------+
| | | |
| Flask | | Entrez |
| +----->+ |
| Backend | | EUtils |
| | | |
+---+-----+ +----------+
|
v
+---+----------+
| |
| Preprocessed |
| MeSH |
| Datafiles |
| |
| |
+--------------+