Show simple item record

dc.creatorRadaković, Jana
dc.creatorBatalović, Katarina
dc.date.accessioned2025-07-17T12:30:01Z
dc.date.available2025-07-17T12:30:01Z
dc.date.copyright2025-07-04
dc.date.issued2025
dc.identifier.urihttps://vinar.vin.bg.ac.rs/handle/123456789/15178
dc.description.abstractThe repository contains materials science, chemistry, and physics-specialized unsupervised trained models. Word embeddings are generated by means of the Word2vec, a natural language processing technique comprised of language model architectures for fast and efficient learning of distributed representations of words. Continuous Skip-gram model architecture with a negative sampling strategy, as implemented in the Gensim library, is employed for model training. The word embeddings consisting of 200 and 300 vectorial components for materials science and 300 vectorial components for chemistry, physics, and mixed domain are here provided.en
dc.language.isoen
dc.rightsmetadata only accesssr
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.sourcefigshare
dc.subjectword2vec modelen
dc.subjectIn-silico materials designen
dc.subjectWord embeddings as autonomous predictorsen
dc.subjectStatic word embeddingsen
dc.subjectWord embeddings variabilityen
dc.subjectMaterials stabilityen
dc.subjectMaterials informaticsen
dc.subjectDigital designen
dc.subjectCheminformaticsen
dc.subjectNatural language processingen
dc.titleDomain-specific Word2vec-based trained models - Chem300, Phys300, MatSci200, MatSci300, and Mixed300en
dc.typemodel
dc.rights.licenseBY
dc.identifier.doi10.6084/m9.figshare.28740122.v1
dc.description.otherThis digital object is hosted on the Figshare server due to its size and is available under the Creative Commons Attribution 4.0 International License.en
dc.type.versionpublishedVersion


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record