Archive for the ‘Chemical IT’ Category

Quantum chemistry interoperability (library): another step towards FAIR data.

Saturday, January 1st, 2022

To be FAIR, data has to be not only Findable and Accessible, but straightforwardly Interoperable. One of the best examples of interoperability in chemistry comes from the domain of quantum chemistry. This strives to describe a molecule by its electron density distribution, from which many interesting properties can then be computed. The process is split into two parts:

(more…)

First came Molnupiravir – now there is Paxlovid as a SARS-CoV-2 protease inhibitor. An NCI analysis of the ligand.

Saturday, November 13th, 2021

Earlier this year, Molnupiravir hit the headlines as a promising antiviral drug. This is now followed by Paxlovid, which is the first small molecule to be aimed by design at the SAR-CoV-2 protein and which is reported as reducing greatly the risk of hospitalization or death when given within three days of symptoms appearing in high risk patients.

(more…)

A comparison of searches based on metadata records from three (update: five) research repositories.

Tuesday, September 28th, 2021

In the previous blog post, I looked at the metadata records registered with DataCite for some chemical computational modelling files as published in three different repositories. Here I take it one stage further, by looking at how searches of the DataCite metadata store for three particular values of the metadata associated with this dataset compare.

(more…)

A comparison of descriptive metadata across different data repositories.

Tuesday, September 28th, 2021

The number of repositories which accept research data across a wide spectrum of disciplines is on the up. Here I report the results of conducting an experiment in which chemical modelling data was deposited in six such repositories and comparing the richness of the metadata describing the essential properties of the six depositions.

(more…)

HPC Access and Metadata Portal (CHAMP).

Monday, September 13th, 2021

You might have noticed if you have read any of my posts here is that many of them have been accompanied since 2006 by supporting calculations, normally based on density functional theory (DFT) and these calculations are accompanied by a persistent identifier pointer to a data repository publication. I have hitherto not gone into the detail here of the infrastructures required to do this sort of thing, but recently one of the two components has been updated to V2, after being at V1 for some fourteen years[1]  and this provides a timely opportunity to describe the system a little more. 

(more…)

References

  1. M.J. Harvey, N.J. Mason, and H.S. Rzepa, "Digital Data Repositories in Chemistry and Their Integration with Journals and Electronic Notebooks", Journal of Chemical Information and Modeling, vol. 54, pp. 2627-2635, 2014. https://doi.org/10.1021/ci500302p

Octopus publishing: dis-assembling the research article into eight components.

Friday, August 13th, 2021

In 2011, I suggested that the standard monolith that is the conventional scientific article could be broken down into two separate, but interlinked components, being the story or narrative of the article and the data on which the story is based. Later in 2018 the bibliography in the form of open citations were added as a distinct third component.[1] Here I discuss an approach that has taken this even further, breaking the article down into as many as eight components and described as “Octopus publishing” for obvious reasons. These are;

(more…)

References

  1. D. Shotton, "Funders should mandate open citations", Nature, vol. 553, pp. 129-129, 2018. https://doi.org/10.1038/d41586-018-00104-7

Room-temperature superconductivity in a carbonaceous sulfur hydride!

Saturday, October 17th, 2020

The title of this post indicates the exciting prospect that a method of producing a room temperature superconductor has finally been achived[1]. This is only possible at enormous pressures however; >267 gigaPascals (GPa) or 2,635,023 atmospheres.

(more…)

References

  1. E. Snider, N. Dasenbrock-Gammon, R. McBride, M. Debessai, H. Vindana, K. Vencatasamy, K.V. Lawler, A. Salamat, and R.P. Dias, "RETRACTED ARTICLE: Room-temperature superconductivity in a carbonaceous sulfur hydride", Nature, vol. 586, pp. 373-377, 2020. https://doi.org/10.1038/s41586-020-2801-z

Exploiting the power of persistent identifiers (PIDs) for locating all kinds of research object.

Saturday, August 29th, 2020

The folks at DataCite have announced a new research object discovery service which aims to give users a “comprehensive overview of connections between entities in the research landscape”. The portal https://commons.datacite.org acts as the entry point for three basic types of persistent identifiers (PIDs);

(more…)

A cascading tutorial in finding rich NMR data using the Datacite datasearch engine.

Saturday, April 11th, 2020

In the previous post, I introduced three of a new generation of search engines specialising in the discovery of data. Data has some special features which make its properties slightly different from the conceptual (or natural language) searches we are used to performing for general information and so a search engine specifically for data is invariably going to reflect this. At the simplest level, the data search can retain much of the generic simplicity of a regular search, but to exploit the unique features of data, one really does have to move on to an advanced mode. Here, by introducing a set of search definitions that gradually increase in specificity and power, I hope to convey some of the flavour of one way in which this could be done.

(more…)

New generations of globally aggregating search engines – for (chemical) data.

Tuesday, April 7th, 2020

Chemists have long been familiar with search engines that aspire to index a large proportion of the chemical literature. Think for example the old-generation (and commercial) SciFinder (Scholar) and Reaxys or those that arrived in the 1990s in the online era such as the non-commercial Pubchem or ChemSpider (there are more). But you may not be as familiar with the latest generation of global search engines and here I will focus on three relatively new ones that specialise specifically in tracking down data rather than just publications.

(more…)