NLTK (Natural Language Toolkit) is a powerful library in Python that provides easy-to-use interfaces to over 50 corpora and lexical resources, including WordNet.
BeautifulSoup is a Python library designed to parse HTML and XML documents. It simplifies tasks like iterating and searching HTML elements by attributes.
In 2022, the official Instagram API allow you to access only your own posts and not even public comments and posts on Instagram because of the rising privacy concerns from the users and frequent accusations of data-breach at many big companies including Facebook. This has made it difficult for programmers to crawl Instagram data.