Data & Automation Lead
Our client is looking for a data and automation lead to help find, cleanse, ingest andautomate data used in the product and process. This person would use ourexisting and new data sets to model and prototype how it could be used in our data model andproduct, create and test ideas to clean the data, and work with our product and engineeringteam to automate, collect, clean and ingest into the product. This person would work with ourcustomer account team to understand what data our customers need and then find waysincluding using, scrapping, LLMs and other tools to find and aggregate the data. After prototypethe collection and aggregate tool the team would work to determine how we automate thefunctionality. This role is responsible for making sure the data feeding our products features isclean, structured, and reliable.—-Role Description1. Help envision, design, prototype, build and test the data models for use in our client's product and process2. Work to find ways to collect, automate, structure and deliver our data across all aspectsof in our client's, including, but not limited to the Market Landscapes, DocumentsDatabase and the City/Company/Product page suite.3. Work with the product and customer success teams to find efficiencies in collecting,organizing and structuring our existing data set.4. Work to align mapping of new data sets to our existing company, product, andgovernment data hierarchies.5. Work with customer success team to identify new sources and acquisition methods fordata based on customer needs6. Use scraping AI, ML, LLMs and other emerging technologies to create proof of concepts,refine them and then work to incorporate them into the product7. Operationalizing disparate PO Data sources: Python ETL automation usingPandas/regex/SQL to consolidate multi-year PO data, apply deduplication, cross-reference column reseller mappings, and replace complex Excel workflows with scalablepipelines.8. Entity extraction & content ML scoring: Implementing NER, fuzzy matching, andsupervised models trained on labeled data POs to classify match confidence andcontinuously improve accuracy.9. Work with product and engineering teams to find the best and more efficient ways to addthe automation and data to our product10. Use and build technical skills to help manage data throughout its lifecycle working withthe engineering team to implement them into the product.Desired Skills● Proficiency in data mining, wrangling, and cleaning of large-scale datasets● Advanced Excel combined with Python (Pandas/NumPy) a plus● Experience maintaining and deploying ML models (Transformers, Pattern Recognition)and handling model persistence (.pkl)● Knowledge or abilities with APIs (OpenAI, Gemini, Hubspot, Slack),● Experience or interest in using AI, LLM and other emerging technologies Additional Job DetailsThis position is fully remote and open to candidates in Mexico and Latin America.
Apply Now
Apply Now