Chem: Dimensionality Reduction & Clustering

3 min readOct 1, 2020

When you have a high dimensional dataset, it is hard to visualize or recognize the pattern. Low dimensionality is the rule of thumb in Chemoinformatics but it is not easy to maintain because we need a lot of descriptors to get the information. It is also hard to build low dimensionality from the beginning since we cannot know the independent of vectors and their correlation. Therefore, dimensionality reduction is preferred.

Dimensionality Reduction




Jeheon Park, Software Engineer at Kakao in South Korea