The aim is to fix and optimize a Python analysis script, consolidate many distributed JSON files into a fast-queryable database (SQL or Parquet), generate graph-rich reports, and embed the resulting API into an existing Qt/Python application.
The assignment
Fix a Python script to analyze production data (issue with loaction of the measured dies within a wafer)
Design a structured database to store and retrieve large datasets ( potentially using SQL, Parquet files) efficiently.
Create a graphical report showcasing key insights from the production data.
Integrate the data analysis and reporting functionality into an existing Qt-based software via API.
Ensure fast data retrieval and querying capabilities for large datasets.
Required knowledge and skills
Advanced Python,
Data mining,
SQL,
Machine Leaning,
Familiar with JSL is a plus.