-
Notifications
You must be signed in to change notification settings - Fork 0
08. PubChemの利用方法 コマンドライン
-
Macのターミナルのアプリが開きます。
-
curl wttr.in/Kumamoto
と入力し、リターンして、以下のようなデータが取得できれば、curl
が利用できます。
bash-3.2$ curl wttr.in/Kumamoto
Weather report: Kumamoto
\ / Clear
.-. 19 °C
― ( ) ― ← 8 km/h
`-’ 10 km
/ \ 0.0 mm
┌─────────────┐
┌──────────────────────────────┬───────────────────────┤ Thu 22 Sep ├───────────────────────┬──────────────────────────────┐
│ Morning │ Noon └──────┬──────┘ Evening │ Night │
├──────────────────────────────┼──────────────────────────────┼──────────────────────────────┼──────────────────────────────┤
│ \ / Sunny │ \ / Sunny │ \ / Partly cloudy │ \ / Partly cloudy │
│ .-. +23(25) °C │ .-. +29(31) °C │ _ /"".-. +25(26) °C │ _ /"".-. +23(25) °C │
│ ― ( ) ― ↓ 5-6 km/h │ ― ( ) ― ↘ 8-9 km/h │ \_( ). ↘ 1-2 km/h │ \_( ). ↘ 3-6 km/h │
│ `-’ 10 km │ `-’ 10 km │ /(___(__) 10 km │ /(___(__) 10 km │
│ / \ 0.0 mm | 0% │ / \ 0.0 mm | 0% │ 0.0 mm | 0% │ 0.0 mm | 0% │
└──────────────────────────────┴──────────────────────────────┴──────────────────────────────┴──────────────────────────────┘
┌─────────────┐
┌──────────────────────────────┬───────────────────────┤ Fri 23 Sep ├───────────────────────┬──────────────────────────────┐
│ Morning │ Noon └──────┬──────┘ Evening │ Night │
├──────────────────────────────┼──────────────────────────────┼──────────────────────────────┼──────────────────────────────┤
│ \ / Sunny │ \ / Sunny │ \ / Sunny │ \ / Clear │
│ .-. +24(26) °C │ .-. +29(32) °C │ .-. +25(27) °C │ .-. +22(25) °C │
│ ― ( ) ― ↘ 4-5 km/h │ ― ( ) ― ↘ 8-9 km/h │ ― ( ) ― → 10-17 km/h │ ― ( ) ― ↘ 13-23 km/h │
│ `-’ 10 km │ `-’ 10 km │ `-’ 10 km │ `-’ 10 km │
│ / \ 0.0 mm | 0% │ / \ 0.0 mm | 0% │ / \ 0.0 mm | 0% │ / \ 0.0 mm | 0% │
└──────────────────────────────┴──────────────────────────────┴──────────────────────────────┴──────────────────────────────┘
┌─────────────┐
┌──────────────────────────────┬───────────────────────┤ Sat 24 Sep ├───────────────────────┬──────────────────────────────┐
│ Morning │ Noon └──────┬──────┘ Evening │ Night │
├──────────────────────────────┼──────────────────────────────┼──────────────────────────────┼──────────────────────────────┤
│ \ / Partly cloudy │ \ / Partly cloudy │ \ / Partly cloudy │ \ / Partly cloudy │
│ _ /"".-. +23(25) °C │ _ /"".-. +27(28) °C │ _ /"".-. +24(26) °C │ _ /"".-. 21 °C │
│ \_( ). ↓ 14-17 km/h │ \_( ). ↘ 14-16 km/h │ \_( ). ↓ 19-24 km/h │ \_( ). ↓ 12-21 km/h │
│ /(___(__) 10 km │ /(___(__) 10 km │ /(___(__) 10 km │ /(___(__) 10 km │
│ 0.0 mm | 0% │ 0.0 mm | 0% │ 0.0 mm | 0% │ 0.0 mm | 0% │
└──────────────────────────────┴──────────────────────────────┴──────────────────────────────┴──────────────────────────────┘
Location: 熊本県, 日本 [32.6450475,130.6341345]
Follow @igor_chubin for wttr.in updates
brewのインストール https://brew.sh/index_ja
- ターミナル上でそのコマンドを実行
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
- インストールは自動で進みますが、途中でパスワードを聞かれます。ここではMacにログインする際のパスワードを入力しましょう。
brew install curl
C:\WINDOWS\system32\curl.exe
https://pubchem.ncbi.nlm.nih.gov/rest/pug | /input | /operation | /output | /operation_options |
---|---|---|---|---|
https://pubchem.ncbi.nlm.nih.gov/rest/pug | /compound/cid | /2244 | /SDF | ?record_type=3d |
Operation Format |
---|
2244 |
1,2,3,4,5 |
+Operation Format |
---|
MolecularFormula |
MolecularWeight |
CanonicalSMILES |
IsomericSMILES |
InChI |
InChIKey |
IUPACName |
XLogP |
ExactMass |
MonoisotopicMass |
TPSA |
Complexity |
Charge |
HBondDonorCount |
HBondAcceptorCount |
RotatableBondCount |
HeavyAtomCount |
IsotopeAtomCount |
AtomStereoCount |
DefinedAtomStereoCount |
UndefinedAtomStereoCount |
BondStereoCount |
DefinedBondStereoCount |
UndefinedBondStereoCount |
CovalentUnitCount |
Volume3D |
XStericQuadrupole3D |
YStericQuadrupole3D |
ZStericQuadrupole3D |
FeatureCount3D |
FeatureAcceptorCount3D |
FeatureDonorCount3D |
FeatureAnionCount3D |
FeatureCationCount3D |
FeatureRingCount3D |
FeatureHydrophobeCount3D |
ConformerModelRMSD3D |
EffectiveRotorCount3D |
ConformerCount3D |
/property/InChI,XLogP,TPSA
Output Format | Description |
---|---|
XML | standard XML, for which a schema is available |
JSON | JSON, JavaScript Object Notation |
JSONP | JSONP, like JSON but wrapped in a callback function |
ASNB | standard binary ASN.1, NCBI’s native format in many cases |
ASNT | NCBI’s human-readable text flavor of ASN.1 |
SDF | chemical structure data |
CSV | comma-separated values, spreadsheet compatible |
PNG | standard PNG image data |
TXT | plain text |
operation_options |
---|
?name_type=word |
?identity_type=same_connectivity |
?StripHydrogen=true |
?Threshold=99 |
?sid=104169547,109967232 |
?list_return=listkey |
?sid=listkey&listkey=12345678910&listkey_start=0&listkey_count=1000 |
?&listkey_start=0&listkey_count=10 |
?sids_type=active |
?sids_type=standardized&list_return=listkey |
?cids_type=same_parent |
?sids_type=standardized&list_return=flat |
?sids_type=standardized&list_return=grouped |
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/sourceid/DTP.NCI/747285/SDF
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/2244/SDF
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/2244/SDF?record_type=3d
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/name/aspirin/SDF
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/name/glucose/SDF
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/smiles/CCCC/cids/TXT
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/smiles/CC\(=O\)CC/cids/TXT
-
C(=O)CC
>C\(=O\)CC
-
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/inchikey/BPGDAMSIGCZZLK-UHFFFAOYSA-N/SDF
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/sourceid/DTP.NCI/747285/PNG
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/2244/PNG
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/2244/PNG?record_type=3d&image_size=small
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/name/aspirin/synonyms/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/smiles/CCCC/synonyms/XML
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/sid/53789435/synonyms/TXT
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/1983/description/XML
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/1983/description/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/name/glucose/sids/XML
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/name/glucose/sids/XML?list_return=listkey
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/name/glucose/cids/XML?list_return=grouped
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/name/glucose/cids/XML?list_return=flat
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/sourceall/MLSMR/sids/JSON?list_return=listkey
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/sid/123061,123079/cids/XML?cids_type=all
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/2244/sids/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/192180/cids/TXT?cids_type=component
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/2244/aids/JSON?aids_type=active
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/2244/sids/JSON?sids_type=component
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/2244/cids/TXT?cids_type=same_connectivity
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/21145249/cids/XML?cids_type=parent
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/1000/sids/XML?sids_type=inactive
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/504526/sids/JSON?sids_type=doseresponse
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/type/doseresponse/aids/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/sourceall/DTP.NCI/aids/XML
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/xref/PatentID/EP0711162A1/sids/XML
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/name/myxalamid/cids/XML?name_type=word
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/name/myxalamid/cids/XML?name_type=complete
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/target/genesymbol/USP2/aids/TXT
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/target/gi/116516899/aids/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/activity/EC50/aids/TXT
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/1000/XML
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/504526/CSV
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/1000/CSV?sid=26736081,26736082,26736083
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/1000/concise/CSV
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/504526/concise/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/490/description/XML
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/490/description/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/1000,1001/assaysummary/CSV
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/sid/104234342/assaysummary/XML
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/sid/104234342/assaysummary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/1000/summary/XML
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/1000/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/504526/doseresponse/XML
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/504526/doseresponse/CSV?sid=104169547,109967232
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/geneid/1956,13649/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/geneid/13649/concise/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/genesymbol/EGFR/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/genesymbol/EGFR/10090/summary/JSON
- (mouse Egfr gene by NCBI Taxonomy ID 10090)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/genesymbol/EGFR/Mus%20musculus/summary/JSON
- (mouse Egfr gene by scientific taxonomy name)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/genesymbol/EGFR/house%20mouse/summary/JSON
- (mouse Egfr gene by common taxonomy name)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/genesymbol/EGFR/mouse/summary/JSON
- (mouse Egfr gene by taxonomy synonym)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/synonym/ERBB1/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/synonym/HGNC:3236/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/synonym/Ensemble:ENSG00000146648/summary/JSON
- (with ID source, recommended)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/synonym/ENSG00000146648/summary/JSON
- (without ID source)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/geneid/1956,13649/summary/JSON
- (by Gene ID)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/genesymbol/EGFR/summary/XML
- (by gene symbol, case insensitive and default to human)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/genesymbol/EGFR/10090/summary/JSON
- (mouse with NCBI TaxonomyID 9606)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/genesymbol/EGFR/Rattus%20norvegicus/summary/JSON
- (mouse with scientific taxonomy name)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/genesymbol/EGFR/Norway%20rat/summary/JSON
- (mouse with common taxonomy name)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/synonym/EGFR/summary/JSON
- (by synonym, note that one synonym may map to multiple GeneIDs)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/geneid/13649/pwaccs/TXT
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/geneid/13649/pwaccs/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/gene/geneid/13649/pwaccs/XML
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/protein/accession/P00533,P01422/summary/JSON
- (single accession or a list of comma-separated accessions)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/protein/synonym/PR:P00533/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/protein/synonym/ChEMBL:CHEMBL203/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/protein/accession/P00533,P01422/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/protein/accession/P00533,P01422/summary/TXT
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/protein/accession/P00533/aids/TXT
- (limited to one protein only)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/protein/accession/Q01279/concise/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/protein/accession/Q01279/aids/TXT
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/66438/concise/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/69721/concise/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/pathway/pwacc/Reactome:R-HSA-70171/summary/JSON
- (single accession)
-
- (a list of comma-separated accessions)
- https://pubchem.ncbi.nlm.nih.gov/rest/pug/pathway/pwacc/Reactome:R-HSA-70171/cids/TXT
- https://pubchem.ncbi.nlm.nih.gov/rest/pug/pathway/pwacc/Reactome:R-HSA-70171/cids/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/pathway/pwacc/Reactome:R-HSA-70171/cids/XML
- (limited to one pathway only)
- https://pubchem.ncbi.nlm.nih.gov/rest/pug/pathway/pwacc/Reactome:R-HSA-70171/geneids/TXT
- https://pubchem.ncbi.nlm.nih.gov/rest/pug/pathway/pwacc/Reactome:R-HSA-70171/geneids/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/pathway/pwacc/Reactome:R-HSA-70171/geneids/XML
- (limited to one pathway only)
- https://pubchem.ncbi.nlm.nih.gov/rest/pug/pathway/pwacc/Reactome:R-HSA-70171/accessions/TXT
- https://pubchem.ncbi.nlm.nih.gov/rest/pug/pathway/pwacc/Reactome:R-HSA-70171/accessions/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/pathway/pwacc/Reactome:R-HSA-70171/accessions/XML
- (limited to one pathway only)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/taxonomy/taxid/9606,2697049/summary/JSON
- (one ID or a list of comma-separated IDs)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/taxonomy/synonym/Homo%20sapiens/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/taxonomy/synonym/human/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/taxonomy/synonym/SARS-COV-2/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/taxonomy/synonym/ITIS:180092/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/taxonomy/taxid/2697049/aids/TXT
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/taxonomy/taxid/2697049/aids/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/taxonomy/taxid/2697049/aids/XML
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/cell/synonym/HeLa/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/cell/synonym/MeSH:D006367/summary/JSON
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/cell/cellacc/CVCL_0030,CVCL_0045/summary/JSON
- (by Cellosaurus cell line accession)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/cell/synonym/HeLa/summary/JSON
- (by synonym)
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/cell/synonym/HeLa/aids/TXT
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/cell/synonym/HeLa/aids/JSON
curl -L https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/name/aspirin/cids/TXT?list_return=grouped > ./aspirin.cids.txt
...
2244
2244
3434975
3434975
12280114
12280114
12280114
24847961
24847962
24847963
145904
...
curl -L https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/2244/aids/TXT?aids_type=active > ./cid2244aids.txt
...
321895
321896
323716
323717
328210
338333
338335
...
curl -L https://pubchem.ncbi.nlm.nih.gov/rest/pug/assay/aid/182665/CSV > ./aid182665.csv
PUBCHEM_RESULT_TAG,PUBCHEM_SID,PUBCHEM_CID,PUBCHEM_EXT_DATASOURCE_SMILES,PUBCHEM_ACTIVITY_OUTCOME,PUBCHEM_ACTIVITY_SCORE,PUBCHEM_ACTIVITY_URL,PUBCHEM_ASSAYDATA_COMMENT,Standard Type,Standard Units,Activity Comment
RESULT_TYPE,,,,,,,,STRING,STRING,STRING
RESULT_DESCR,,,,,,,,Standardized activity type (e.g. IC50 rather than Ic-50/Ic50/ic50/ic-50),Selected units for 'Standard Type': e.g. concentrations are in nM,Additional comments
1,103164874,2244,CC(=O)OC1=CC=CC=C1C(=O)O,Active,,,Potential missing data,Inhibition,%,Active
2,103183760,13392299,C1=CC(=CC=C1C2=NC(=C(O2)C3=COC=C3)CC(=O)O)Cl,Unspecified,,,Potential missing data,Inhibition,%,
3,103184577,13773145,CCCCC1=NC(=C(O1)C2=CC=CO2)CC(=O)OCC,Unspecified,,,Potential missing data,Inhibition,%,
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/substance/sid/127378063/xrefs/PatentID/XML
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/name/vioxx/xrefs/RegistryID,RN,PubMedID/JSONP
-
部分構造、類似性、様々な部分一致(原子の接続は同じだが立体化学は未指定など)など、より高度な構造検索が利用できます。PUG RESTでは、分子式検索もこのカテゴリーに入ります。
-
これらの検索では、PubChemデータベース全体を検索するのに時間がかかるため、先に述べたPUG RESTリクエストの30秒以内に結果が出ない可能性があります。
-
これを回避するために、PUG RESTは「非同期」操作と呼ばれるものを使用しており、検索を開始するときにジョブチケットのようなものである要求識別子を取得します。
-
そして、検索が終了したかどうかを定期的に(例えば5-10秒ごとに)チェックし、終了していれば結果を取得します。
- RESTリクエストを送信し、要求識別子を取得
{
"Waiting": {
"ListKey": "3734994328758159440",
"Message": "Your request is running"
}
}
- 要求識別子を利用し、結果を取得
-
同一性、類似性、部分構造などの高速化な検索
-
- same_connectivity
-
- substructure
-
- similarity_2d
-
https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/fastsimilarity_3d/cid/2244/cids/JSON
- similarity_3d
-
web service definition language (WSDL)
-
PUG SOAP Client Help
curl -L -H "Accept: text/rdf" -o CID2244.rdf http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID2244
-
https://pubchem.ncbi.nlm.nih.gov/rest/rdf/query?graph=synonym&name=aspirin
-
https://pubchem.ncbi.nlm.nih.gov/rest/rdf/query?graph=synonym&name=aspirin&contain=true
-
https://pubchem.ncbi.nlm.nih.gov/rest/rdf/query?graph=synonym&name=aspirin&return=compound
-
https://pubchem.ncbi.nlm.nih.gov/rest/rdf/query?graph=synonym&name=aspirin&format=json
-
https://pubchem.ncbi.nlm.nih.gov/rest/rdf/query?graph=substance&predicate=rdf:type&offset=10000
-
https://pubchem.ncbi.nlm.nih.gov/rest/rdf/query?graph=synonym&pred=rdf:type&obj=sio:CHEMINF_000561
-
About
をクリック
-
Downlod
[From the PubChem FTP site](https://pubchemdocs.ncbi.nlm.nih.gov/downloads#_3-2)
をクリック
-
例えば、 https://ftp.ncbi.nlm.nih.gov/pubchem/Compound/CURRENT-Full/SDF/ にアクセスすると、下図が表示される。
- 必要に応じて、各種ソフトでデータをダウンロードすることができる。
bash-3.2$ wget https://ftp.ncbi.nlm.nih.gov/pubchem/Compound/CURRENT-Full/SDF/Compound_152500001_153000000.sdf.gz
--2022-09-13 10:07:06-- https://ftp.ncbi.nlm.nih.gov/pubchem/Compound/CURRENT-Full/SDF/Compound_152500001_153000000.sdf.gz
ftp.ncbi.nlm.nih.gov (ftp.ncbi.nlm.nih.gov) をDNSに問いあわせています... 130.14.250.7, 165.112.9.230
ftp.ncbi.nlm.nih.gov (ftp.ncbi.nlm.nih.gov)|130.14.250.7|:443 に接続しています... 接続しました。
HTTP による接続要求を送信しました、応答を待っています... 200 OK
長さ: 74473800 (71M) [application/x-gzip]
`Compound_152500001_153000000.sdf.gz' に保存中
Compound_152500001_153000000.sdf.gz 100%[================================================================================================>] 71.02M 861KB/s 時間 94s
2022-09-13 10:08:42 (770 KB/s) - `Compound_152500001_153000000.sdf.gz' へ保存完了 [74473800/74473800]
bash-3.2$
- ファイルの確認
bash-3.2$ gzcat Compound_152500001_153000000.sdf.gz | less
- ファイルの表示
152500001
-OEChem-08292205122D
107114 0 1 0 0 0 0 0999 V2000
6.0365 -0.8200 0.0000 O 0 0 0 0 0 0 0 0 0 0 0 0
8.5745 -0.3156 0.0000 O 0 0 0 0 0 0 0 0 0 0 0 0
7.3183 -2.5779 0.0000 O 0 0 0 0 0 0 0 0 0 0 0 0
4.0365 -0.8235 0.0000 O 0 0 0 0 0 0 0 0 0 0 0 0
10.2991 -1.4937 0.0000 O 0 0 0 0 0 0 0 0 0 0 0 0
5.0000 4.6130 0.0000 O 0 0 0 0 0 0 0 0 0 0 0 0
13.2688 -1.3586 0.0000 O 0 0 0 0 0 0 0 0 0 0 0 0
14.5549 -2.4362 0.0000 O 0 0 0 0 0 0 0 0 0 0 0 0
7.3122 0.9422 0.0000 N 0 0 0 0 0 0 0 0 0 0 0 0
7.3122 2.5517 0.0000 N 0 0 0 0 0 0 0 0 0 0 0 0
5.5000 0.7470 0.0000 N 0 0 0 0 0 0 0 0 0 0 0 0
4.6340 2.2470 0.0000 N 0 0 0 0 0 0 0 0 0 0 0 0
6.0000 3.6130 0.0000 N 0 0 0 0 0 0 0 0 0 0 0 0
7.0365 -0.8183 0.0000 C 0 0 1 0 0 0 0 0 0 0 0 0
7.6229 -0.0083 0.0000 C 0 0 1 0 0 0 0 0 0 0 0 0
7.6257 -1.6263 0.0000 C 0 0 1 0 0 0 0 0 0 0 0 0
8.5762 -1.3156 0.0000 C 0 0 1 0 0 0 0 0 0 0 0 0
5.5380 -1.6869 0.0000 C 0 0 1 0 0 0 0 0 0 0 0 0
4.5380 -1.6887 0.0000 C 0 0 1 0 0 0 0 0 0 0 0 0
6.0395 -2.5521 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
4.0395 -2.5556 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
5.5410 -3.4190 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
4.5410 -3.4207 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
9.3863 -1.9020 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
7.8958 1.7470 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
6.3660 1.2470 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
6.3660 2.2470 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
11.1091 -2.0801 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
5.5000 2.7470 0.0000 C 0 0 3 0 0 0 0 0 0 0 0 0
:
rsync -Pav ftp.ncbi.nlm.nih.gov::pubchem/Compound/CURRENT-Full/SDF/\*.gz pubchem/