An AI-powered system for processing old construction drawings. Rotation of scanned documents, extraction of key data fields.
Our client operates a system that integrates customers with large volumes of accumulated construction drawings, some of which are very old, scanned paper documents. Before these drawings can be uploaded into the system, they need to be indexed by extracting key information from the title block, such as the drawing number, scale, revision, and other parameters. As this task is highly repetitive and time-consuming, it has been outsourced to manual labor, but this approach is slow and costly. The client sought a faster, more efficient solution using machine learning and AI.
The resulting system is an AI-powered solution that automates the extraction of key data fields from construction drawings and corrects their orientation for seamless integration into the client’s system. It combines Azure Document Intelligence for data extraction with a two-step image orientation correction process using Tesseract and PaddleOCR.
Azure Document Intelligence forms the backbone of the system. We trained a custom model using sample drawings provided by the client, teaching it to recognize and extract six key fields: title, type, drawing number, sheet number, scale, and revision.
Unlike generic OCR tools, Azure’s ability to learn from labeled data ensures high accuracy, even with the variability found in old, scanned drawings. This eliminates the need for manual intervention and significantly speeds up the indexing process.
To ensure all drawings are correctly aligned before data extraction, the system employs a two-step orientation correction process:
The system was designed with cost efficiency in mind. By automating the data extraction and orientation correction processes, it eliminates the need for expensive manual labor. Budget estimates show that the automated solution is already more cost-effective than outsourcing, while also delivering faster and more consistent results.
The results exceeded expectations. By training the Azure AI model, we achieved highly accurate data extraction for the six target fields. The automated solution not only outperformed manual labor in terms of speed and cost but also introduced an additional benefit: automatic orientation correction, a task that was previously ignored in manual processes.
We estimated the budget for this automation and found it to be 3 times less expensive than outsourcing to manual labor. Additionally, the inclusion of orientation correction adds significant value, as it ensures all documents are uniformly aligned within the system.