Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Learn Challenge: Cluster a Compound Library | Similarity, Clustering and Drug Discovery
Python for Chemoinformatics

bookChallenge: Cluster a Compound Library

Task

Swipe to start coding

Write a Python function using RDKit that takes a list of SMILES strings and groups them into clusters based on pairwise Tanimoto similarity. Each cluster should contain molecules where every member has a Tanimoto similarity above 0.6 with at least one other member in the cluster.

  • Parse each SMILES string into an RDKit molecule.
  • Generate Morgan fingerprints for each molecule.
  • Compare fingerprints pairwise using Tanimoto similarity.
  • Group molecules so that each cluster contains molecules with at least one similarity above 0.6 to another member.
  • Return a list of clusters, where each cluster is a list of SMILES strings.

Solution

Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 2. ChapterΒ 4
single

single

Ask AI

expand

Ask AI

ChatGPT

Ask anything or try one of the suggested questions to begin our chat

close

bookChallenge: Cluster a Compound Library

Swipe to show menu

Task

Swipe to start coding

Write a Python function using RDKit that takes a list of SMILES strings and groups them into clusters based on pairwise Tanimoto similarity. Each cluster should contain molecules where every member has a Tanimoto similarity above 0.6 with at least one other member in the cluster.

  • Parse each SMILES string into an RDKit molecule.
  • Generate Morgan fingerprints for each molecule.
  • Compare fingerprints pairwise using Tanimoto similarity.
  • Group molecules so that each cluster contains molecules with at least one similarity above 0.6 to another member.
  • Return a list of clusters, where each cluster is a list of SMILES strings.

Solution

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 2. ChapterΒ 4
single

single

some-alt