Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Learn Challenge: Find Similar Drug-like Molecules | Similarity, Clustering and Drug Discovery
Practice
Projects
Quizzes & Challenges
Quizzes
Challenges
/
Python for Chemoinformatics

bookChallenge: Find Similar Drug-like Molecules

Task

Swipe to start coding

Write a function to identify molecules from a list of candidate SMILES strings that are similar to a given reference SMILES, using Tanimoto similarity.

  • Parse the reference_smiles string into an RDKit molecule and generate its Morgan fingerprint with a radius of 2.
  • For each SMILES in candidate_smiles_list, parse it into an RDKit molecule and generate its Morgan fingerprint with a radius of 2.
  • Compute the Tanimoto similarity between the reference fingerprint and each candidate fingerprint.
  • Return a list of SMILES strings for those candidates with similarity strictly greater than 0.7.

Before running this code or the tests, you must install the RDKit library in your environment. If you control the environment, use 'conda install -c conda-forge rdkit' or 'pip install rdkit'. If you do not control the environment, contact the platform support or check their documentation for available packages.

Solution

Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 2. ChapterΒ 2
single

single

Ask AI

expand

Ask AI

ChatGPT

Ask anything or try one of the suggested questions to begin our chat

Suggested prompts:

Can you explain this in simpler terms?

What are the main takeaways from this?

Can you provide an example to illustrate this?

close

bookChallenge: Find Similar Drug-like Molecules

Swipe to show menu

Task

Swipe to start coding

Write a function to identify molecules from a list of candidate SMILES strings that are similar to a given reference SMILES, using Tanimoto similarity.

  • Parse the reference_smiles string into an RDKit molecule and generate its Morgan fingerprint with a radius of 2.
  • For each SMILES in candidate_smiles_list, parse it into an RDKit molecule and generate its Morgan fingerprint with a radius of 2.
  • Compute the Tanimoto similarity between the reference fingerprint and each candidate fingerprint.
  • Return a list of SMILES strings for those candidates with similarity strictly greater than 0.7.

Before running this code or the tests, you must install the RDKit library in your environment. If you control the environment, use 'conda install -c conda-forge rdkit' or 'pip install rdkit'. If you do not control the environment, contact the platform support or check their documentation for available packages.

Solution

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 2. ChapterΒ 2
single

single

some-alt