This job ad has been posted over 40 days ago! (*)
Nexedi is looking for a 6 months trainee to develop an application to allow to quickly and intelligently search through a large codebase hosted on github/gitlab such as the over 10 million lines of code maintained by Nexedi. The minimum outcome should be a codecrawler replicating the functionality of our old gitweb repository with a number of additional features.
In addition to this we are curious to know whether it is possible to use Natural Language Processing Models on our codebase and what the results may be (we’ll coin this “Artificial Language Processing”). We want to know whether it is possible to find some sort of structure in a codebase or some sort of grammar. Identify properties, categories and data types instead of people, locations and organisations? The exact scope of this part of the traineeship to be defined.