Commit 6be1e3e0 authored by Emma Schymanski's avatar Emma Schymanski
Browse files

Merge branch 'pfas-docs-updates' into 'main'

Updated docs

See merge request !1
parents 650024eb 1d93f1ca
---
title: "PFAS and Fluorinated Organic Compounds in PubChem Tree"
title: "PFAS and Fluorinated Compounds in PubChem Tree"
author:
- "Emma L. Schymanski^1^*, Parviel Chirsir^1^, Todor Kondic^1^,"
- "Paul A. Thiessen^2^, Jian Zhang^2^ and Evan E. Bolton^2^*"
date: "27/03/2022"
date: "02/06/2022"
output: pdf_document
csl: journal-of-cheminformatics.csl
bibliography: refs.bib
......@@ -34,32 +34,33 @@ EEB: [0000-0002-5959-6190](http://orcid.org/0000-0002-5959-6190).
## Preamble
This document describes the "[PFAS and Fluorinated Organic Compounds in
This document describes the "[PFAS and Fluorinated Compounds in
PubChem Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=120)"
(see Figure 1) on the
[Classification Brower](https://pubchem.ncbi.nlm.nih.gov/classification/)
(hereafter
"[PubChem PFAS Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=120)")
<!-- on the -->
<!-- [Classification Brower](https://pubchem.ncbi.nlm.nih.gov/classification/) -->
in [PubChem](https://pubchem.ncbi.nlm.nih.gov/) [@kim_pubchem_2021],
developed in collaboration between PubChem (NCBI/NLM/NIH) and the
developed jointly between PubChem (NCBI/NLM/NIH) and the
Environmental Cheminformatics group
([ECI](https://wwwen.uni.lu/lcsb/research/environmental_cheminformatics))
at the [LCSB](https://wwwen.uni.lu/lcsb/),
[University of Luxembourg](https://wwwen.uni.lu/) in consultation with
[University of Luxembourg](https://wwwen.uni.lu/), in consultation with
several community representatives (see [Contributions](#contrib)
and [Acknowledgements](#ack)).
The
[PFAS Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=120)
(see [Figure 1](##treenodes))
[PubChem PFAS Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=120)
(see [Figure 1](#treenodes) and [Contents listing](#cont))
includes all compounds in [PubChem](https://pubchem.ncbi.nlm.nih.gov/)
that satisfy various definitions, as explained later in this document.
Each compound in PubChem has a PubChem Compound Identifier (CID), and the
satisfying various definitions, as explained later in this document.
Note that each compound in PubChem has a PubChem Compound Identifier (CID), and the
blue numbers next to each node header reflects the number of
compounds (_i.e._ CIDs) in that node.
To become more familiar with the PubChem Classification Browser features
in general before embarking on content specific to the PFAS tree,
see the Section [Navigating the Tree](#search).
There is also extensive documentation on the PubChem website
(links below) or reach out to
More details on the general
[PubChem Classification Brower](https://pubchem.ncbi.nlm.nih.gov/classification/)
features are given in the Section [Navigating the Tree](#search), at
the links below, or by reaching out to
[pubchem-help@ncbi.nlm.nih.gov](mailto:pubchem-help@ncbi.nlm.nih.gov)
for more information:
......@@ -68,46 +69,39 @@ for more information:
- https://pubchem.ncbi.nlm.nih.gov/classification/docs/classification_help.html
## Contents
## Contents {#cont}
<!-- This document is organised into several sections, as follows: -->
Table: _Contents page for this documentation._
Table: _Contents list for the PubChem PFAS Tree documentation._
| Section | Navigation | PDF Page |
|-----------|---------|:----:|
|PubChem PFAS Tree Nodes | [Go to heading](#treenodes) | 2 |
|_OECD PFAS Definition_ | [Go to heading](#oecddef) | 2 |
|_Organofluorine Compounds_ | [Go to heading](#orgf) | 5 |
|_PFAS and Fluorinated Organic Compound Collections_ | [Go to heading](#lists) | 5 |
| - _OECD PFAS Definition_ | [Go to heading](#oecddef) | 2 |
| - _Organofluorine Compounds_ | [Go to heading](#orgf) | 5 |
| - _Other Diverse Fluorinated Compounds_ | [Go to heading](#divf) | 6 |
| - _PFAS and Fluorinated Compound Collections_ | [Go to heading](#lists) | 7 |
|Navigating the Tree | [Go to heading](#search) | 7 |
|_Search via PubChem Search_ | [Go to heading](#pc-search) | 7 |
|_Interactions via Entrez_ | [Go to heading](#entrez) | 9 |
|_Interactions via PUG REST_ | [Go to heading](#pugrest) | 10 |
|Further Details | [Go to heading](#details) | 12 |
|Statements | [Go to heading](#statements) | 12 |
|References | [Go to heading](#refs) | 13 |
<!-- To become more familiar with the PubChem Classification Browser features -->
<!-- in general before embarking on content specific to the PFAS tree, -->
<!-- see Section [Navigating the Tree](#search). -->
| - _Search via PubChem Search_ | [Go to heading](#pc-search) | 8 |
| - _Interactions via Entrez_ | [Go to heading](#entrez) | 9 |
| - _Interactions via PUG REST_ | [Go to heading](#pugrest) | 12 |
|Further Details | [Go to heading](#details) | 13 |
|Statements | [Go to heading](#statements) | 14 |
|References | [Go to heading](#refs) | 14 |
<!-- There is also extensive documentation on the PubChem website, see: -->
<!-- - https://pubchem.ncbi.nlm.nih.gov/classification/ -->
<!-- - https://pubchemdocs.ncbi.nlm.nih.gov/classification-browser -->
<!-- - https://pubchem.ncbi.nlm.nih.gov/classification/docs/classification_help.html -->
## PubChem PFAS Tree Nodes {#treenodes}
The tree is currently split into three main nodes that are constructed and
compiled separately (see Figure 1).
The tree is currently split into four main nodes that are constructed and
compiled separately (see [Figure 1](#treenodes)).
More nodes are under development and will be released as they are ready.
Further details are given below.
<!-- To become more familiar with the PubChem Classification Browser features, -->
<!-- see Section [Navigating the Tree](#search). -->
Further details about each of the nodes are given below.
![_The "[PFAS and Fluorinated Organic Compounds in PubChem Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=120)" Landing Page._](fig/PFAS_Tree_Landing.png)
To become more familiar with the PubChem Classification Browser features,
see Section [Navigating the Tree](#search).
![_The "[PFAS and Fluorinated Compounds in PubChem Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=120)" Landing Page (29 May 2022)._](fig/PFAS_Tree_Landing.png)
......@@ -121,26 +115,25 @@ CF~3~ part) in the 2021 OECD Report
Note that here, "**PFAS part**" is used to describe a connected portion of
the molecule that satisfies the OECD PFAS definition. A given molecule may have
more than one PFAS part present, some examples are given in Figure 2,
along with the count of parts. For more information, see
[Further Details](#details).
along with the count of parts. For more information, see section
"[Further Details](#details)".
Browsing the 6 million entries in this node (see Figure 3) is challenging.
Since most of these PFAS contain isolated CF~2~ (600 K entries) or
CF~3~ groups (5.4 M entries), these were separated into individual sections
<!-- (see "[_isolated CF~2~ and CF~3~_](#isonodes)"). -->
(see [next section](#isonodes)).
~188 K compounds contain PFAS parts larger than CF~2~/CF~3~
(see "[larger PFAS parts](#largerparts)").
(see "[Isolated CF~2~ and CF~3~ Nodes](#isonodes)").
<!-- (see [next section](#isonodes)). -->
Approximately 188 K compounds contain PFAS parts larger than CF~2~/CF~3~
(see "[PFAS Parts Larger than CF~2~/CF~3~](#largerparts)").
<!-- #### PFAS "parts": -->
![_Examples of molecules with varying PFAS parts highlighted, drawn using [CDK Depict](https://www.simolecule.com/cdkdepict/depict.html) [@mayfield_cdk]._](fig/PFAS_parts_CDK.png)
The _OECD PFAS Definition_ node
The _OECD PFAS Definition_ node,
with the top two level subnodes, is shown in Figure 3.
![_The OECD PFAS Definition part of the PFAS tree, with top two subnodes (24 March 2022)._](fig/OECDPFAS_TopTwoSubnodes_v3.png)
![_The OECD PFAS Definition part of the PFAS tree, with top two subnodes (29 May 2022)._](fig/OECDPFAS_TopTwoSubnodes.png)
......@@ -161,12 +154,12 @@ is added (_e.g.,_ Figure 4, bottom left, "_Contains isolated unsaturated-linear
PFAS part_"), if not, a list of the possibilities is given directly
(_e.g.,_ Figure 4, middle left, "_Contains isolated unsaturated-cyclic part_").
The "_Contains only isolated CF~2~_" (or, for the CF~3~ node, only isolated
CF~3~) is broken down by the number of isolated groups (CF~2~ or,
The "_Contains only isolated CF~2~_"
(or, for the CF~3~ node, "_Contains only isolated CF~3~_")
is broken down by the number of isolated groups (CF~2~ or,
for the CF~3~ node, by CF~3~ groups) - see Figure 4, middle panel. In both
cases, the vast majority of molecules have only one isolated group.
The "_Contains only isolated CF~2~/CF~3~_" is also broken down by
The "_Contains only isolated CF~2~/CF~3~_" node is also broken down by
the number of groups, sorted by increasing number of CF~2~ groups
(for both nodes). See Figure 4, right panel.
......@@ -176,8 +169,8 @@ the number of groups, sorted by increasing number of CF~2~ groups
The "_Molecule contains PFAS parts larger than CF~2~/CF~3~_" part of the
OECD PFAS node includes about 188 K molecules, which can be browsed
in two major breakdowns, by isolated PFAS part count (see Figure 5)
and by isolated PFAS part type (see Figure 6).
in two major breakdowns, by _isolated PFAS part count_ (see Figure 5)
and by _isolated PFAS part type_ (see Figure 6).
This section of the tree is constructed dynamically - in other words,
the subnodes present depend on the contents within - to prevent
excessive scrolling.
......@@ -207,7 +200,7 @@ ensure logical sorting.
#### The _Breakdown by isolated PFAS part type_
is first broken down by the
part type (linear, cyclic, _etc._) (_e.g.,_ Figure 6, left panel). These are
part type (linear, cyclic, _etc._) as shown in Figure 6, left panel. These are
again split dynamically. With fewer than 20 entries, the list split
according to PFAS part formulas appears. If a greater breakdown is needed,
an extra layer of "_Also contains ..._" or "_Only contains ..._" is
......@@ -237,7 +230,7 @@ a [MetFrag](https://msbi.ipb-halle.de/MetFrag/)
(DOI: [10.5281/zenodo.6385954](https://doi.org/10.5281/zenodo.6385954))
for use in
[MetFragCL](https://ipb-halle.github.io/MetFrag/projects/metfragcl/)
and will be made available from the
and is available from the
[MetFragWeb](https://msbi.ipb-halle.de/MetFrag/)
dropdown menu. This file contains several useful fields
from the [Download](#pc-search) file as well as Patent and Literature
......@@ -277,27 +270,59 @@ The exact mass subcategories are split into the ranges
1-250, 250-500, 500-750, 750-1000 and >1000 - and are only present
if there are CIDs within this range.
<!-- The exact mass is split as follows: -->
<!-- - Exact mass range 1-250 -->
<!-- - Exact mass range 250-500 -->
<!-- - Exact mass range 500-750 -->
<!-- - Exact mass range 750-1000 -->
<!-- - Exact mass range >1000 -->
### Other Diverse Fluorinated Compounds {#divf}
The "_Other Diverse Fluorinated Compounds_" section of the
[PubChem PFAS Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=120)
is designed to help users explore various
cases of fluorine chemistry that are not necessarily covered in the
[OECD PFAS](#oecddef)
or [Organofluorine compound](#orgf) sections above. The navigation in this
section helps explore fluorinated compound chemistry by various
fluorine-heteroatom bonds and the occurrence of different elements
(see Figure 8).
Many of the compounds present in this section are also present
in the other sections of the
[PubChem PFAS Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=120).
The overlap can be investigated in Entrez (see section
[Interactions via Entrez](#entrez) below).
![_The "Other diverse fluorinated compounds" part of the PubChem PFAS Tree, showing the breakdown by fluorine bonded to non-carbon elements and by non-organic element (numbers from 29 May 2022)._](fig/DiverseFcmpds.png)
#### The _Contains fluorine bond to non-carbon element_
section (Figure 8, middle panel) is broken down first by the
count of molecules present in the given category, then by the
non-carbon element present in the F-element bond (sorted alphabetically).
For the sections with counts above 100, there is an extra breakdown
by the numbers of fluorine present overall.
#### The _Contains non-organic element_
section (Figure 8, right panel) is likewise broken down first by the
count of molecules present in the given category, then by the
non-organic element present (sorted alphabetically).
In this section, non-organic refers to any element that is not
C, H, N, O, P, S, Si, F, Cl, Br or I.
As above, there is an extra breakdown by the numbers of fluorine
present overall for the sections with counts above 100.
### PFAS and Fluorinated Organic Compound Collections {#lists}
The "_PFAS and Fluorinated Organic Compound Collections_"
section of the PFAS tree contains various lists gathered
across PubChem content (see Figure 8). Additional community-based PFAS lists may
also be added here. The mapping files to construct this are kept
### PFAS and Fluorinated Compound Collections {#lists}
The "_PFAS and Fluorinated Compound Collections_"
section of the PubChem PFAS tree contains various lists gathered
across PubChem content (see Figure 9).
The mapping files to construct this are kept
on the [eci/pubchem](https://gitlab.lcsb.uni.lu/eci/pubchem/)
repository on GitLab.
![_The "PFAS and Fluorinated Organic Compound Collections" node, with all major collections shown (CompTox as inset). Numbers and content listing from 24 March 2022._](fig/PFAS_list_of_lists.png)
![_The "PFAS and Fluorinated Compound Collections" node, with all major collections shown (CompTox and OntoChem as insets). Numbers and content listing from 29 May 2022._](fig/PFAS_list_of_lists.png)
Currently, the content displayed in Figure 8 comes from:
Currently, the content displayed in Figure 9 comes from:
- All [PFAS lists](https://comptox.epa.gov/dashboard/chemical-lists?filtered=&search=PFAS)
from the
......@@ -310,13 +335,20 @@ from the NORMAN Suspect List Exchange
([NORMAN-SLE](https://www.norman-network.com/nds/SLE/)) via the
[NORMAN-SLE Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=101)
in PubChem;
- The CORE PFAS lists from OntoChem [@barnabas_extracting_2022];
- The CORE and Patent PFAS lists from OntoChem [@barnabas_extracting_2022];
- Other collections from within PubChem Classification Trees, including
collections from
[Cameo](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=86),
[ChEBI](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=2) and
[MeSH](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=1).
Additional community-based PFAS can also be added to this section.
We will be happy to add new collections where feasible.
If you have any suggestions, please email
[pubchem-help@ncbi.nlm.nih.gov](mailto:pubchem-help@ncbi.nlm.nih.gov) or
[normansle@uni.lu](mailto:normansle@uni.lu) for further details.
## Navigating the Tree {#search}
......@@ -328,20 +360,20 @@ sections.
### Search via PubChem Search {#pc-search}
Perhaps the most intuitive interaction is directly through
clicking on the numbers besides each node (see Figure 9). This sends a query
clicking on the numbers besides each node (see Figure 10). This sends a query
directly to the PubChem Search interface and displays the
entire node contents, as shown in Figure 9. This query follows
entire node contents, as shown in Figure 10. This query follows
"_OECD PFAS Definition_" > "_Molecule contains PFAS parts larger than
CF~2~/CF~3~_" > "_Breakdown by isolated PFAS part count_" >
"_Contains 01 isolated PFAS part_" > "_Count of molecules 10001-100000_" >
"_Contains 01xC04F09-linear_" and returns the 10,555 CIDs containing
only one single linear C~4~F~9~ PFAS part.
This query can then be downloaded (Figure 9, inset),
"_Contains 01xC04F09-linear_" and returns the 10,555 CIDs (26 March, 2022)
containing only one single linear C~4~F~9~ PFAS part.
This query can then be downloaded (Figure 10, inset),
or sent to Entrez for advanced querying (see [next section](#entrez)).
Note that clicking on the "**?**" beside a node (where present) will open a
tool tip explaining the node contents (Figure 9, bottom left).
tool tip explaining the node contents (Figure 10, bottom left).
![_Querying node contents in PubChem Search. When clicking on the blue numbers (left), a search window will open in a new tab (right, main image). This collection can be browsed, downloaded (see inset) or sent to Entrez (see next section). Clicking on the "**?**" sign next to a node name will open a tool tip (left panel, bottom, see yellow blurb)._](fig/Tree_PubChemSearch.png)
![_Querying node contents in PubChem Search. When clicking on the blue numbers (left), a search window will open in a new tab (right, main image). This collection can be browsed, downloaded (see inset) or sent to Entrez (see next section). Clicking on the "**?**" sign next to a node name will open a tool tip (left panel, bottom, see yellow blurb). Image from 26 March 2022._](fig/Tree_PubChemSearch.png)
The download file contains a number of fields of interest,
including: CIDs, names and synonyms, several properties (_e.g._ XlogP),
......@@ -350,7 +382,7 @@ as well as several metadata entries. These metadata entries contain
valuable information about the evidence contributing to the presence
of that structure in PubChem (_e.g.,_ contribution source(s) and date,
annotation information). Relevant fields are explained in Table 2
and shown in Figure 10.
and shown in Figure 11.
Table: _Relevant metadata files in the PubChem Download files._
......@@ -366,7 +398,7 @@ Table: _Relevant metadata files in the PubChem Download files._
<!-- cid cmpdname cmpdsynonym mw mf polararea complexity xlogp heavycnt hbonddonor hbondacc rotbonds inchi isosmiles inchikey iupacname meshheadings annothits annothitcnt aids cidcdate sidsrcname depcatg annotation -->
![_PubChem Download file. Top left: CID, names, properties. Middle: structural information and metadata. Bottom: selected metadata with expanded view to show the information content of records. Downloaded from the query shown in Figure 9 on 27 March 2022._](fig/PubChem_Download_File.png)
![_PubChem Download file. Top left: CID, names, properties. Middle: structural information and metadata. Bottom: selected metadata with expanded view to show the information content of records. Downloaded from the query shown in Figure 10 on 27 March 2022._](fig/PubChem_Download_File.png)
Note that the categories visible in the "_annothits_" column align
with the individual sections in PubChem records and can
......@@ -384,7 +416,7 @@ in Figure 10 can be viewed with the following hyperlinks:
- https://pubchem.ncbi.nlm.nih.gov/compound/105447#section=Safety-and-Hazards
- https://pubchem.ncbi.nlm.nih.gov/compound/105447#section=Use-and-Manufacturing
As visible in the figure, there are many records where the information
As visible in Figure 11, there are many records where the information
has only been extracted from patents, or for which no annotation exists.
Thus, this metadata can help add a lot of context to the relevance of the
entries for the particular question at hand.
......@@ -397,22 +429,22 @@ as explained in the next section.
It is possible to build more extensive queries via the
[Entrez](https://pubchemdocs.ncbi.nlm.nih.gov/advanced-search-entrez)
interface, which is accessible through the button below
the download button (see Figure 9) or by clicking the "Use Entrez"
the download button (see Figure 10) or by clicking the "Use Entrez"
option on the PubChem landing page. More documentation on Entrez is given
[here](https://pubchemdocs.ncbi.nlm.nih.gov/advanced-search-entrez).
This section steps through a few interactive examples.
#### Example 1: Find all PFAS containing one linear C~4~F~9~ part with use information:
To find all molecules from the query in Figure 9 that also have
To find all molecules from the query in Figure 10 that also have
use information in PubChem, the first step is to send the 10,555 CIDs
from the query above to Entrez via the "Push to Entrez" option (Figure 9,
from the query above to Entrez via the "Push to Entrez" option (Figure 10,
second box encircled in red on the right). This opens a new page in the
Entrez interface (not shown).
Next, go to the "Use and Manufacturing" section of the
[PubChem TOC Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=72),
send this to PubChem Search via the numbers next to the node (Figure 11,
red circle on left), and push to Entrez (Figure 11, top right). By
selecting the "Advanced" option under the search bar (Figure 11, top),
send this to PubChem Search via the numbers next to the node (Figure 12,
red circle on left), and push to Entrez (Figure 12, top right). By
selecting the "Advanced" option under the search bar (Figure 12, top),
the Advanced Search builder is opened and further queries can be built.
By selecting "#2 AND #6", only the 436 chemicals with a single
C~4~F~9~ linear PFAS part (query #2) that also have use and manufacturing
......@@ -426,25 +458,25 @@ Analytical chemists may, for instance, be particularly keen on finding
out which PFAS or organofluorine compounds have mass spectrometry information
available in PubChem (or in resources integrated within PubChem). It is
also possible to use the Entrez functionality to subset the tree
contents according to other available information - shown in Figure 12
contents according to other available information - shown in Figure 13
for this example. First, go to the "Mass Spectrometry" section of the
[PubChem TOC Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=72),
which is under the "Spectral Information" heading, and send this query
to Entrez (see Figure 12 left and top right). Then, go back to the
to Entrez (see Figure 13 left and top right). Then, go back to the
[PubChem PFAS Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=120)
and ***refresh*** the contents. A new dropdown menu will appear
(if not already present) called "Filter by Entrez History" (Figure 12,
(if not already present) called "Filter by Entrez History" (Figure 13,
bottom right). By selecting the chosen query in this dropdown menu,
the tree will then be subset by the contents within that query, such
that only CIDs that are in the tree _and_ in the query will show
(here, ~54K not 19M CIDs).
(in Figure 13, ~54K not 19M CIDs).
The same holds for any advanced query, so it would be possible to
_e.g._ do a subset of only mass spectra that occur in
[MassBank EU](https://massbank.eu/MassBank/) or NIST by additionally
adding the relevant "_Information Sources_" (from the
adding the relevant "Information Sources" (from the
[PubChem TOC Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=72))
to the Entrez query. Since large queries such as the Mass Spectrometry
to the Entrez query. Since large queries such as the "Mass Spectrometry"
category, or advanced AND/OR combinations can end up quite complicated,
it is a good idea to carefully note the query number (#XXX) and the
number of compounds in the result, to ensure the correct entries are
......@@ -454,12 +486,17 @@ Also note that it is possible to send queries to Entrez via the
[PubChem Identifier Exchange Service](https://pubchem.ncbi.nlm.nih.gov/idexchange/idexchange.cgi).
Thus, it is possible to add external queries to Entrez history
by uploading this information via the
[ID Exchange](https://pubchem.ncbi.nlm.nih.gov/idexchange/idexchange.cgi).
[ID Exchange](https://pubchem.ncbi.nlm.nih.gov/idexchange/idexchange.cgi),
as shown in Figure 14.
![_Subsetting Tree Contents via Entrez. Left: [PubChem TOC Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=72), "Mass Spectrometry" subsection. Top right: the "Mass Spectrometry" query in PubChem Search (to be sent to Entrez). Bottom right: the [PubChem PFAS Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=120) subset by Mass Spectrometry, now only displaying CIDs where mass spectrometry information is available in PubChem. Queries run on 27 March 2022._](fig/Entrez_MSandPFAS.png)
More examples coming soon ...
<!-- More examples coming soon ... -->
![_Sending queries to Entrez via the [PubChem ID Exchange](https://pubchem.ncbi.nlm.nih.gov/idexchange/idexchange.cgi)._](fig/IDExch_to_Entrez.png)
### Interactions via PUG REST {#pugrest}
......@@ -474,6 +511,8 @@ please see the following locations in the PubChem documentation:
- https://pubchemdocs.ncbi.nlm.nih.gov/pug-rest$classification_nodes
#### Interacting with the PubChem PFAS Tree in R:
The following contains a few tips to start interacting with the tree in R;
note that some of these features are also in active development.
......@@ -543,21 +582,24 @@ Nonetheless, some
technical details are necessary and are contained in this section, which
will be expanded as further questions arise.
### Compounds Excluded from the PubChem PFAS Tree
The [PubChem PFAS Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=120)
currently excludes molecules (compounds) from consideration if they:
- are a mixture (i.e., has multiple components, which includes any salts);
- are a mixture (i.e., contain multiple components, including any salts);
- contain a radical or isotopically labelled atom.
Since the entire tree is constructed on CIDs (_i.e._, compounds), substance
entries (denoted by substance identifiers, SID) are also not included. Thus,
undefined or poorly defined entities are also not included.
undefined or poorly defined entities are also not included. Polymer entries
are also not included.
More information about the difference between compound and substances
on PubChem is available
[here](https://pubchemblog.ncbi.nlm.nih.gov/2014/06/19/what-is-the-difference-between-a-substance-and-a-compound-in-pubchem/).
#### PFAS Test set:
### PFAS Test set
A test set of PFAS and non-PFAS from the OECD Report
[@oecd_reconciling_2021] has been compiled to check the
performance of the
......@@ -568,15 +610,16 @@ requested (and if reasonably possible).
### Future plans
The current approach still has room for improvement; the following are being addressed
The [PubChem PFAS Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=120)
is undergoing active development.
The following are being addressed
in future developments (and will be released as ready). These include:
- Handling of ethers and other connecting atoms;
- Handling of unsaturated PFAS;
- Better browseability of special cases.
- Handling of unsaturated PFAS.
### Contact Details
## Contact Details
User feedback is extremely valuable to help improve this tree further.
Please reach out to either contact author (details on first page,
......@@ -584,13 +627,19 @@ or email [Evan](mailto:evan.bolton@nih.gov) and
[Emma](mailto:emma.schymanski@uni.lu) directly)
with feedback and comments!
If you have any suggestions for PFAS or fluorinated compound collections to
include in the "_PFAS and Fluorinated Compound Collections_" section of the
[PubChem PFAS Tree](https://pubchem.ncbi.nlm.nih.gov/classification/#hid=120),
please contact us at
[pubchem-help@ncbi.nlm.nih.gov](mailto:pubchem-help@ncbi.nlm.nih.gov) or
[normansle@uni.lu](mailto:normansle@uni.lu).
For general questions about PubChem and the functionality
described here, please reach out to the
[PubChem Help](mailto:pubchem-help@ncbi.nlm.nih.gov)
mailing list for further support.
[PubChem Help mailing list](mailto:pubchem-help@ncbi.nlm.nih.gov)
for further support.
<!-- ## Closing -->
## Statements {#statements}
......
No preview for this file type
"hid","SourceName","SourceID","HNID","nCIDs","nodeNames","nodeHNID","nodeIDs","parentIDs","node_nCIDs","REST_URL"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"OECD PFAS definition",5517102,"node_1","root",6096212,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5517102/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Molecule contains isolated CF2",5521752,"node_2","node_1",601930,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5521752/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Contains CF2 and larger PFAS parts",5516635,"node_3","node_2",12284,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5516635/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Contains only isolated CF2",5519454,"node_1062","node_2",518285,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5519454/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Contains only isolated CF2/CF3",5524746,"node_1076","node_2",71361,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524746/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Molecule contains isolated CF3",5523183,"node_1147","node_1",5414868,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5523183/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Contains CF3 and larger PFAS parts",5518208,"node_1219","node_1147",25025,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5518208/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Contains only isolated CF2/CF3",5517629,"node_1148","node_1147",71361,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5517629/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Contains only isolated CF3",5520213,"node_2389","node_1147",5318482,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520213/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Molecule contains PFAS parts larger than CF2/CF3",5525061,"node_2420","node_1",188084,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5525061/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Breakdown by isolated PFAS part count",5520278,"node_2421","node_2420",188084,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520278/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Breakdown by isolated PFAS part type",5524707,"node_5660","node_2420",188084,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524707/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Organofluorine compounds",5523075,"node_8982","root",19080012,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5523075/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Fluorinated aliphatic substances",5524117,"node_9398","node_8982",820900,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524117/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Fluorinated aliphatic substances that have a fully fluorinated methyl or methylene carbon atom",5520826,"node_9486","node_9398",536283,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520826/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Other fluorinated aliphatic substances that do NOT have a fully fluorinated methyl or methylene carbon atom",5520510,"node_9399","node_9398",284617,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520510/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Fluorinated aromatic substances",5521756,"node_8983","node_8982",18258541,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5521756/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"(Non-)Fluorinated aromatic ring(s) with fluorinated aliphatic side chain(s) that do NOT have a fully fluorinated methyl or methylene carbon atom",5524683,"node_9311","node_8983",1441556,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524683/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Fluorinated aromatic ring(s) with fluorinated aliphatic side chain(s) that have a fully fluorinated methyl or methylene carbon atom",5520571,"node_9234","node_8983",818299,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520571/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Fluorinated aromatic ring(s) with non-fluorinated aliphatic side chain(s)",5516437,"node_8984","node_8983",11311085,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5516437/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Fluorinated aromatic substances without a side chain",5519611,"node_9151","node_8983",34597,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5519611/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Non-fluorinated aromatic ring(s) with fluorinated aliphatic side chain(s) that have fully fluorinated methyl or methylene carbon atom",5519122,"node_9069","node_8983",4653004,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5519122/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Other fluorinated substances",5525297,"node_9571","node_8982",571,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5525297/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Contains 1 Fluorine atom",5523988,"node_9587","node_9571",370,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5523988/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Contains 2 Fluorine atoms",5520669,"node_9578","node_9571",112,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520669/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Contains 3 Fluorine atoms",5519498,"node_9575","node_9571",40,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5519498/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Contains 4 Fluorine atoms",5518047,"node_9572","node_9571",24,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5518047/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Contains 5 Fluorine atoms",5522125,"node_9584","node_9571",12,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5522125/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Contains 6 Fluorine atoms",5524803,"node_9593","node_9571",11,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524803/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Contains 7 Fluorine atoms",5525169,"node_9597","node_9571",2,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5525169/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"PFAS and Fluorinated Organic Compound Collections",5518087,"node_8923","root",36235,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5518087/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"CompTox Chemicals Dashboard PFAS Suspect Lists",5519025,"node_8937","node_8923",8498,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5519025/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFAS75S1] PFAS|EPA: List of 75 Test Samples (Set 1)",5516407,"node_8938","node_8937",73,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5516407/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFAS75S2] PFAS|EPA: List of 75 Test Samples (Set 2)",5520863,"node_8958","node_8937",75,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520863/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFASCAT] PFAS|EPA Structure-based Categories",5516769,"node_8939","node_8937",81,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5516769/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFASDW537] PFAS|EPA|WATER: Existing EPA DW Method 537.1",5521910,"node_8963","node_8937",18,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5521910/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFASDW] PFAS|EPA: New EPA Method Drinking Water",5522971,"node_8967","node_8937",25,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5522971/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFASDWTREAT] PFAS|EPA|WATER: Drinking Water Treatment Technology",5518051,"node_8945","node_8937",8,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5518051/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFASINSOL] PFAS|EPA: Chemical Inventory Insoluble in DMSO",5521524,"node_8961","node_8937",42,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5521524/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFASINV] PFAS|EPA: ToxCast Chemical Inventory",5517742,"node_8943","node_8937",427,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5517742/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFASINVIVO] PFAS|EPA: In Vivo Studies Available",5517886,"node_8944","node_8937",22,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5517886/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFASLITSEARCH] PFAS|EPA: Literature Search Completed",5521727,"node_8962","node_8937",22,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5521727/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFASNONDW] PFAS|EPA: New EPA Method Non-Drinking Water",5524299,"node_8969","node_8937",23,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524299/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFASRESEARCH] PFAS|EPA: EPA PFAS Research List",5524835,"node_8972","node_8937",164,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524835/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFASRL] PFAS|EPA: Cross-Agency Research List",5518845,"node_8950","node_8937",192,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5518845/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFASTOX] PFAS|EPA: Toxicity Assessments",5516853,"node_8941","node_8937",8,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5516853/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[EPAPFASVALDW] PFAS|EPA|WATER: PFAS with Validated EPA Drinking Water Methods",5524434,"node_8970","node_8937",30,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524434/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASDEV1] PFAS|EPA PFAS chemicals without explicit structures",5518472,"node_8948","node_8937",44,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5518472/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASGLUEGE] PFAS|NORMAN: Overview of PFAS Uses from Gluege et al (2020)",5519721,"node_8954","node_8937",482,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5519721/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASINVITRO] PFAS|EPA: List of chemicals tested in in vitro methods 2019-2020",5519243,"node_8953","node_8937",181,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5519243/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASKEMI] PFAS: List from the Swedish Chemicals Agency (KEMI) Report",5523875,"node_8968","node_8937",1472,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5523875/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASLCMSGCMS] PFAS: Collection of GC-MS and LC-MS standards: Food Contact Materials",5521221,"node_8960","node_8937",37,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5521221/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASMASTER] PFAS Master List of PFAS Substances (Version 2)",5518495,"node_8949","node_8937",8116,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5518495/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASNORDIC] PFAS: Nordic PFAS Report 2019",5522465,"node_8964","node_8937",202,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5522465/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASNTREV19] PFAS: PFAS in Non-Target HRMS Studies (Liu et al 2019)",5522601,"node_8966","node_8937",126,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5522601/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASOECD] PFAS: Listed in OECD Global Database",5518176,"node_8946","node_8937",3701,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5518176/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASOECDNA] NORMAN: List of PFAS from the OECD Curated by Nikiforos Alygizakis",5519841,"node_8955","node_8937",3205,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5519841/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASPACKAGING] PFAS|EPA PFAS Substances in Pesticide Packaging",5520210,"node_8956","node_8937",7,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520210/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASSTRUCT] Navigation Panel to PFAS Structure Lists",5524542,"node_8971","node_8937",8078,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524542/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASSTRUCTV1] PFAS|EPA: PFAS structures in DSSTox (update March 2018)",5520485,"node_8957","node_8937",4350,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520485/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASSTRUCTV2] PFAS|EPA: PFAS structures in DSSTox (update November 2019)",5519088,"node_8952","node_8937",6624,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5519088/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASSTRUCTV3] PFAS|EPA: PFAS structures in DSSTox (update August 2020)",5516782,"node_8940","node_8937",8136,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5516782/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASSTRUCTv4] PFAS|EPA: PFAS structures in DSSTox (update August 2021)",5518869,"node_8951","node_8937",8078,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5518869/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASTDB] WATER|PFAS: PFAS Chemicals contained in the EPA Drinking Water Treatability Database",5520989,"node_8959","node_8937",37,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520989/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASTOXDB] PFAS: PFAS-Tox Database",5525323,"node_8973","node_8937",42,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5525323/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASTRI] PFAS: PFAS to the Toxics Release Inventory (TRI) Program by the National Defense Authorization Act",5517389,"node_8942","node_8937",97,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5517389/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PFASTRIER] PFAS Community-Compiled List (Trier et al. 2015)",5522560,"node_8965","node_8937",588,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5522560/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"[PRORISKPFAS] NORMAN|List of PFAS Compiled from NORMAN-SusDat",5518201,"node_8947","node_8937",3371,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5518201/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"NORMAN-SLE PFAS Suspect Lists",5517745,"node_8928","node_8923",5884,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5517745/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"S09 | PFASTRIER | PFAS Suspect List of fluorinated substances from X. Trier and colleagues",5523688,"node_8934","node_8928",468,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5523688/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"S14 | KEMIPFAS | PFAS Highly Fluorinated Substances List from KEMI",5524111,"node_8936","node_8928",1344,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524111/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"S25 | OECDPFAS | List of PFAS from the OECD",5522807,"node_8932","node_8928",3692,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5522807/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"S46 | PFASNTREV19 | List of PFAS reported in Non-Target HRMS Studies from Liu et al 2019",5523394,"node_8933","node_8928",680,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5523394/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"S80 | PFASGLUEGE | Overview of PFAS Uses",5522532,"node_8931","node_8928",1250,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5522532/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"S89 | PRORISKPFAS | List of PFAS Compiled from NORMAN SusDat",5516725,"node_8929","node_8928",4240,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5516725/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"S92 | FLUOROPHARMA | List of 340 ATC classified fluoro-pharmaceuticals",5520737,"node_8930","node_8928",290,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520737/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"S94 | FLUOROPEST | List of 423 FRAC/HRAC/IRAC classified fluoro-agrochemicals",5523938,"node_8935","node_8928",318,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5523938/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"OntoChem PFAS Lists",5517067,"node_8924","node_8923",26805,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5517067/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"OntoChem PFAS from CORE - Definition A",5522450,"node_8926","node_8924",26805,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5522450/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"OntoChem PFAS from CORE - Definition B",5524740,"node_8927","node_8924",4114,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524740/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"OntoChem PFAS from CORE - Defintion C",5521474,"node_8925","node_8924",3432,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5521474/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"Other Organic Fluorinated Chemical Content in PubChem",5524741,"node_8974","node_8923",1674,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524741/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"MeSH: Fluorinated Hydrocarbons",5517545,"node_8975","node_8974",295,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5517545/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"CAMEO Chemicals: Fluorinated Organic Compounds",5521039,"node_8981","node_8974",120,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5521039/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19156786,"ChEBI: Organofluorine Compound",5519872,"node_8980","node_8974",1372,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5519872/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"OECD PFAS definition",5517102,"node_1","root",6125571,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5517102/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Molecule contains isolated CF2",5521752,"node_2","node_1",606617,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5521752/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains CF2 and larger PFAS parts",5516635,"node_3","node_2",12656,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5516635/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains only isolated CF2",5519454,"node_1063","node_2",522138,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5519454/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains only isolated CF2/CF3",5524746,"node_1077","node_2",71823,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524746/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Molecule contains isolated CF3",5523183,"node_1148","node_1",5439708,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5523183/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains CF3 and larger PFAS parts",5518208,"node_1220","node_1148",25685,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5518208/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains only isolated CF2/CF3",5517629,"node_1149","node_1148",71823,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5517629/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains only isolated CF3",5520213,"node_2391","node_1148",5342200,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520213/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Molecule contains PFAS parts larger than CF2/CF3",5525061,"node_2422","node_1",189410,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5525061/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Breakdown by isolated PFAS part count",5520278,"node_2423","node_2422",189410,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520278/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Breakdown by isolated PFAS part type",5524707,"node_5677","node_2422",189410,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524707/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Organofluorine compounds",5523075,"node_8955","root",19355625,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5523075/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Fluorinated aliphatic substances",5524117,"node_9372","node_8955",834808,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524117/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Fluorinated aliphatic substances that have a fully fluorinated methyl or methylene carbon atom",5520826,"node_9459","node_9372",556208,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520826/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Other fluorinated aliphatic substances that do NOT have a fully fluorinated methyl or methylene carbon atom",5520510,"node_9373","node_9372",278600,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520510/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Fluorinated aromatic substances",5521756,"node_8956","node_8955",18458181,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5521756/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"(Non-)Fluorinated aromatic ring(s) with fluorinated aliphatic side chain(s) that do NOT have a fully fluorinated methyl or methylene carbon atom",5524683,"node_9286","node_8956",1427661,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5524683/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Fluorinated aromatic ring(s) with fluorinated aliphatic side chain(s) that have a fully fluorinated methyl or methylene carbon atom",5520571,"node_9209","node_8956",831061,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5520571/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Fluorinated aromatic ring(s) with non-fluorinated aliphatic side chain(s)",5516437,"node_8957","node_8956",11442983,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5516437/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Fluorinated aromatic substances without a side chain",5519611,"node_9126","node_8956",34788,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5519611/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Non-fluorinated aromatic ring(s) with fluorinated aliphatic side chain(s) that have fully fluorinated methyl or methylene carbon atom",5519122,"node_9044","node_8956",4721688,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5519122/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Other fluorinated substances",5525297,"node_9544","node_8955",84349,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5525297/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains a F-Br bond",5542848,"node_9923","node_9544",106,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5542848/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains a F-Cl bond",5543043,"node_9947","node_9544",253,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5543043/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains a F-I bond",5542546,"node_9872","node_9544",489,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5542546/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains a F-N bond",5542017,"node_9614","node_9544",20875,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5542017/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains a F-O bond",5542026,"node_9700","node_9544",10884,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5542026/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains a F-P bond",5543145,"node_9988","node_9544",7268,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5543145/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains a F-S bond",5542136,"node_9782","node_9544",32288,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5542136/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains a F-Si bond",5541978,"node_9545","node_9544",12557,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5541978/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Other diverse fluorinated compounds",5541979,"node_10067","root",109077,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5541979/cids/TXT"
120,"PubChem","pfas5_pubchem_tree",5516029,19504874,"Contains fluorine bond to non-carbon element",5542175,"node_10452","node_10067",23155,"https://pubchem.ncbi.nlm.nih.gov/rest/pug/classification/hnid/5542175/cids/TXT"