Should have good 5 + experience in Python development Should have some experience in implementing data science algorithms like classification, clustering, neural networks(basics) Should know NLP concepts like TfIDF, Word2Vec etc. Should have hands-on experience in AWS services like EMR, Lambda, EC2, SageMaker Should know how to query MongoDB/DocumentDB to extract data Should know web scraping techniques or implement APIs to extract data from sites like Pubmed, ClinicalTrial