Contract Conversion & Analytics 

Customer Challenge

To enable data driven decisions, the Air Force required a process to convert raw non-searchable PDF contracts into machine readable structured and unstructured formats to enable actionable data to be extracted, analyzed, and visualized. 

Innovative Solution

ILW created a processing pipeline that automated the extraction of semi-structured form and table data embedded within Air Force contracts and parsed this data into structured tabular outputs. The extracted contract text, forms, and tables are ingested into a NoSQL database, allowing for east search capability for Air Force users. ILW utilizes text mining tools to search these converted contracts for compliance with various regulations. 

Benefits/Outcomes

  • Insight into an almost untapped source of data
  • Converted 3.7 million Air Force contracts into machine readable language
  • Processed 300,000 computational hours
  • Parsed 7 types of PDF forms and tables into structured format

Business Value

New search capability enables enterprise-level understanding on contract compliance, contract health, data rights

Toolbox

  • Data Science, NLP, ML, Text Mining
  • Optical Character Recognition
  • High Performance Computing
  • Open-source Python solution using DoD compatible libraries (Pandas, Tabula, Fitz, Scikit-learn, OpenCV)
  • Tesseract and Couchbase

Related Case Studies You May Like

Text Analytics of PDF Technical Documents

Text Analytics of PDF Technical Documents

Expert Capture Webinars

Expert Capture Webinars

Data Engineering & Data Science

Data Engineering & Data Science

Paint Hangar IoT Monitoring

Paint Hangar IoT Monitoring

Pre-Flight Inspection AR Application

Pre-Flight Inspection AR Application

Expert Capture Maintenance Training Pilot

Expert Capture Maintenance Training Pilot

Navy Automated Data Cleansing with ML

Navy Automated Data Cleansing with ML

Automated Data Capture and Prediction

Automated Data Capture and Prediction

Automated Data Crosswalks

Automated Data Crosswalks

Contract Conversion & Analytics

Contract Conversion & Analytics

Database Tuning & Optimization

Database Tuning & Optimization

Augmented Reality Tools to Increase Workforce Productivity Across the Enterprise

Augmented Reality Tools to Increase Workforce Productivity Across the Enterprise

Decision Support for Cyber Hygiene

Decision Support for Cyber Hygiene

Augmented Reality Engineering Collaboration

Augmented Reality Engineering Collaboration

Big Data Ingestion & Cloud Architecture

Big Data Ingestion & Cloud Architecture

Cloud-Native Azure PaaS Architecture

Cloud-Native Azure PaaS Architecture

Azure Data Integration Hub Modernization

Azure Data Integration Hub Modernization

Data Warehousing & Business Intelligence

Data Warehousing & Business Intelligence

App Service Azure Infrastructure

App Service Azure Infrastructure

Big Data Engineering for Improved Analytics

Big Data Engineering for Improved Analytics

Agile Big Data Development

Agile Big Data Development

Cloud-Based Big Data Analytics

Cloud-Based Big Data Analytics

Supply Chain Predictive Analytics

Supply Chain Predictive Analytics

Cost Allocation Rules Engine Modernization

Cost Allocation Rules Engine Modernization

Data Services Cloud Migration Support

Data Services Cloud Migration Support

Predictive Analytics for the Aircraft Digital Thread

Predictive Analytics for the Aircraft Digital Thread

On-Demand Maintenance Analytics

On-Demand Maintenance Analytics

Algorithm Development & Text Analytics

Algorithm Development & Text Analytics

Machine Learning & NLP for Decision Support

Machine Learning & NLP for Decision Support

Sensor Data Analysis for Predictive CBM+

Sensor Data Analysis for Predictive CBM+

Cutting-Edge Responsive Design

Cutting-Edge Responsive Design

Data Cleansing and Migration

Data Cleansing and Migration

Application Modernization

Application Modernization

Large-Scale Data Integration

Large-Scale Data Integration

Data Science Big Data Ingestion

Data Science Big Data Ingestion

Modern Analytics Framework

Modern Analytics Framework

Agile Big Data Analytics Framework

Agile Big Data Analytics Framework

Big Data Hadoop Administration

Big Data Hadoop Administration

Modern Data Ingestion Framework

Modern Data Ingestion Framework

Performance Tuning & Best Practices

Performance Tuning & Best Practices

Engines Forecast Reporting Tool

Engines Forecast Reporting Tool

Augmented Reality Combustion Chamber & Gear Pump Disassembly

Optimization Using Hadoop

Data Quality & Lineage Mapping

Big Data Platform Analytics Outcomes

Modern Analytic Framework

Data Cleansing and Migration

Enterprise Data Warehousing

Value-Driven Analytics

Enterprise Data Exchange

Valuable Insight into Customer Shopping Behaviors

Interested In Working With Us?