Materials Platform for Data Science (MPDS) Dataset
Materials Platform for Data Science Dataset (Project Pauling Files) is a highly curated inorganic materials dataset with a 30-year track record, based on about half a million peer-reviewed scientific publications and powering several successful commercial products from the world's leading publishers. It integrates materials data from 405,100 publications, linking 139,005 phase diagrams, 409,771 crystalline nanostructures, and 1,075,676 physical property sets into 189,682 materials phases within a large-scale materials graph of ~5 million nodes and ~150 million edges, making it a comprehensive materials platform for data science and a cornerstone among modern material science datasets and materials project database resources.
-
- Publications
- 405,100
-
- Phase diagrams
- 139,005
-
- Crystalline nanostructures
- 409,771
-
- Property sets
- 1,075,676
-
- Material phases
- 189,682
- Materials Science
- Chemistry
- Physics
- Data Science
- Machine Learning
Materials Platform for Data Science Dataset (Project Pauling Files) is a highly curated inorganic materials dataset with a 30-year track record, based on about half a million peer-reviewed scientific publications and powering several successful commercial products from the world's leading publishers. It integrates materials data from 405,100 publications, linking 139,005 phase diagrams, 409,771 crystalline nanostructures, and 1,075,676 physical property sets into 189,682 materials phases within a large-scale materials graph of ~5 million nodes and ~150 million edges, making it a comprehensive materials platform for data science and a cornerstone among modern material science datasets and materials project database resources.
- Materials Science
- Chemistry
- Physics
- Data Science
- Machine Learning
-
- Publications
- 405,100
-
- Phase diagrams
- 139,005
-
- Crystalline nanostructures
- 409,771
-
- Property sets
- 1,075,676
-
- Material phases
- 189,682
Dataset Info
| Characteristic | Data |
| Description | Inorganic materials with a 30-year history, powering academic research and commercial applications |
| Data types | Relational SQL database, JSON export |
| Tasks | Academic and industrial R&D, materials discovery, ML modeling |
| Labeling | 100+ categories: elements, formulas, properties, symmetry, etc. |
| Language | Controlled scientific English |
Technical
Characteristics
| Characteristic | Data |
| Format | JSON, schema |
| Searchable fields | Physical properties, chemical elements, material classes, crystal system, formula, space group, etc. |
Dataset Use Cases
FAQs
Unidata Cases
Similar Datasets
Why Companies Trust Unidata's Datasets
Share your project requirements, we handle the rest. Every service is tailored, executed, and compliance-ready, so you can focus on strategy and growth, not operations.
What our clients are saying
UniData
Our Clients Love Us
Ready to get started?
Tell us what you need — we’ll reply within 24h with a free estimate
- Andrew
- Head of Client Success
— I'll guide you through every step, from your first
message to full project delivery
Thank you for your
message
We use cookies to enhance your experience, personalize content, ads, and analyze traffic. By clicking 'Accept All', you agree to our Cookie Policy.

