There is a new KNIME forum. You can still browse and read content from our old forum but if you want to create new posts or join ongoing discussions, please visit our new KNIME forum: https://forum.knime.com

Questions about MarvinSketch & MolConverter Nodes

Member for

3 years 10 months mtest

Hello,

 

I have some SMILES and would like to count several substructures using the RDKit Substructure Counter Node.

Usually I have just compared the SMILES to SMARTS that I already had but now I would like draw my own SMARTS. 

However, I stumbled upon some issues. For example, drawing Benzene in MarvinSketch and converting it to SMARTS format via MolConverter appears to be not working. I have several SMILES that contain Benzene but I just don't get any matches using the RDKit Substructure Counter. By converting the drawn structures to SMILES format instead I do get matches but - in another example - the drawn structure for Aniline (-NH2) will also match Nitrobenzene (-NO2) as if the Hydrogens were simply ignored.

 

Do I have to change any settings? Or is my approach completely wrong perhaps? Any alternatives in this case?

 

Help is much appreciated!

Michael

Comments
Thu, 01/04/2018 - 07:01

Member for

4 years 10 months

evert.homan_scilifelab.se

Hi,

You can generate SMARTS (or Smiles) directly from MarvinSketch by changing the output format (see image). So if nothing else you should be able to skip the MolConvert step. Maybe this improves the searching behavior?

Smiles usually do not contain explicit hydrogens so if you do a substructure search on aniline defined as Smiles

NC1=CC=CC=C1

you will get nitrobenzene as hit. If you want to exclude this you nee to define a more specific query, by specifically attaching 2 hydrogens to the nitrogen:

[H]N([H])C1=CC=CC=C1

Cheers/Evert

Files
Capture.PNG32.34 KB
Mon, 01/15/2018 - 12:23

Member for

3 years 10 months

mtest

Hello,

 

thank you very much!

 

Just one more question: using MarvinSketch I do not actually get the Smiles code but simply the drawn molecule (converted to SMI). Is there any way to change the structure to a Smiles code? Or is there a way to make those hydrogens explicit in MarvinSketch?

 

Cheers

Michael

Wed, 01/17/2018 - 07:44

Member for

4 years 10 months

evert.homan_scilifelab.se

In the results table you can right-click on the Smiles column header and then choose 'String' instead of Smiles as the 'Available Renderer' But this you need to do each time you reopen the results table.

If you always want to see the actual Smiles strings you need to change the global rendering preferences for Marvin Smiles output (see image).

Good luck/Evert

Files
Capture.PNG75.37 KB
Thu, 01/18/2018 - 10:22

Member for

3 years 10 months

mtest

Thanks a lot - this was really helpful!!

 

Cheers

Michael