Successful Research Project: the EDAG Group Pipeline "AIdentify"

Artificial intelligence redefines the evaluation and writing of short technical texts in automotive development 

After two years' research, the EDAG Group has completed the project "artificial intelligence for the semantic analysis of short technical texts", or "AIdentify". The focus of the project is on automated text analysis and editing using artificial intelligence, because it is complex, technical texts which form the basis of automotive development. Natural Language Processing, or NLP, is at the heart of such analyses. The project was carried out by the largest independent developer of mobility technology in cooperation with denkbares GmbH, a digital transformation think tank. Funded by the Bavarian State Ministry for Economic Affairs, Regional Development and Energy within the framework of the Bavarian Joint Research Program (BayVFP), the interdisciplinary team investigated the use of AI in the evaluation and application of short technical texts. The aim is to be able to detect inconsistencies in vehicle development, trends in the automotive industry and country-specific defects in vehicles more quickly and easily with the aid of NLP text analysis.

"For the EDAG Group, machine text analysis and generation is another milestone in the establishment and expansion of a future-oriented ecosystem for the mobility of tomorrow," says Cosimo De Carlo, director and CEO of the EDAG Group. "With the EDAG Pipeline AIdentify, we now have a promising prototype for the structured and largely automated transfer of knowledge and information that will significantly advance technology development in the automotive industry."

In the research project, EDAG's software development team led by Jacek Burger, Head of Embedded Systems & Computer Vision/AI, looked into whether, and if so how, NLP in particular can help us to deal with the rapid increase in the number of short texts. Especially in the automotive industry, large numbers of such short texts are generated in test bench reports or activity reports written by service technicians, for instance, or in customer complaints. These need to be automatically evaluated and processed in a database-based ticket system with the aid of AI.

"The difference between short technical texts and prose is that the former are written by numerous authors, all with different background knowledge. They often contain spelling mistakes, codes, abbreviations, multilingual terms and colloquialisms. This is where standard NLP concepts come up against their limits," says Nathalie Klingler, one of the EDAG Group's software engineers and a specialist in the field of explainable artificial intelligence, who, along with Jochen Nüßle, an EDAG Group software engineer in Lindau, was responsible for supervising the AIdentify project. 

To the best of EDAG's AI team's knowledge, databases of this type have so far only been used for filing, and not as a source of knowledge. But this is about to change. "AIdentify permits the output of semantically similar texts based on an initial text. The AI evaluates the concepts, then derives from this recommendations for handling short technical texts. This enables employees to access solutions to similar problems, providing them with useful support in their work with short technical texts," says Jochen Nüßle.

The Pipeline AIdentify developed by the EDAG Group has reached a good, functional level now that the two-year research project has been completed. Already, it makes the extraction of similar texts from a database and their semantic processing possible. 

"The pipeline already handles several applications in a way that is both useful and reliable," explains Jacek Burger. "It improves text quality, consistency checks in tickets, knowledge extraction, and also clustering.“ The aim now is to carry out additional evaluations and practical tests on the software, so as to develop a robust and modular toolbox which is ready for the market. It should be possible use it for additional applications and databases without too many adjustments.