30 credits – Natural language processing on technical documents
Thesis project at Scania is an excellent way of making contacts for your future working life. Many of our current employees started their career with a thesis project. This thesis is within the data science team at Scania IT and you will be working within a field of great strategic value for Scania.
There are a lot of technical documents and complaints arriving to Scania on a regular basis from workshops. These short texts are usually written in haste and therefore they are sometimes filled with mistakes to a degree where the language is difficult to evaluate by machine. Due to many technical terms in the texts ordinary spelling correction will fail.
Purpose of the thesis project is to find a way of doing spelling correction that will work well with the technical texts.
Creating a spelling algorithm that will work in the context of the existing environment.
Specify education or specialisation: master student in IT or statistics, data science or similar.
Knowledge in the following subjects would be beneficial: Big data, Hadoop and related technologies, data mining, machine learning, natural language processing, statistics and programming.
Number of students: 1-2
Start date: January 2019
Estimated time needed: 20 weeks
Contact persons and supervisors:
Isolde Snellman, IXAD, 08-553 71 117
Annette Hultåker, IXAD, 08-553 82 097
Your application should contain a covering letter, CV and transcripts.
Selections will be made throughout the application period.
Publication date from - until
2018-08-24 – 2018-12-02