Building a Code Search Engine: Using NLP to Find Similar Methods Across Libraries

Speaker: Divya Rani

Track: Data Science

Type: Remote Talk

Room: Central Room (Seminar Room 2)

Time: Oct 04 (Fri): 14:45

Duration: 0:45

A developer may decide to implement a library despite the existence of similar libraries, taking into account factors such as computational performance, language or platform dependency, and accuracy. Therefore, there are several libraries with overlapped functionalities.
In this talk audience will learn how to build a code search engine using Natural Language Processing (NLP) techniques. We'll explore how NLP can be applied using Python's Natural Language Toolkit (NLTK) to function documentation and signatures to uncover similarities between methods across different libraries, enabling developers to find the most suitable code for their needs.

The talk assumes some Python knowledge; NLP knowledge is not required as a brief overview will be given during the talk.

URLs


Thinkst Canary
Python Software Foundation SARAO
AWS City of Cape Town
Afrolabs Centre for High Performance Computing
Black Python Devs