Experience
Outlier AI
Artificial Intelligence Training Engineer Jun 2024 - Jun 10, 2024- I worked on a project called Flamingo MultiTurn, whose objective is to test a generative model using prompts. We compared two responses generated by the LLM model and evaluated which was better in several dimensions
- These evaluations allowed us to obtain the best answers and generated new training data that contributed to refining and optimizing the LLM. This experience allowed me to develop advanced skills in language model analysis, data management and collaborative work in a cutting-edge technological environment.
- Language: Python, JavaScript, Java, TypeScript
- Frameworks: Vue.js, ReactJS
LexCom AI
Tech Lead -> Oct. 2023 - Jun. 2024- Led the development of an AI-based product profitability prediction system for an e-commerce platform, plus extra services such as text generators using OpenAi API , the project covers from backend and frontend development to cloud deployment.
- Utilized data balancing techniques such as SMOTE during the training phase to enhance dataset accuracy and improve model performance
- Coordinated the backend and frontend integration, ensuring a smooth and functional user experience
- Managed the implementation of secure authentication with JWT and the configuration of reverse proxies with Nginx Proxy Manager.
- I performed the containerization of the application using Docker, facilitating the deployment and maintenance of the software.
- I deployed the solution on Google Cloud Platform (Compute Engine)
- Development of Artificial Intelligence using advanced machine learning techniques.
- Built with: Python, Typescript, Django REST framework, React, Vite, PostgresSQL, JWT, Docker, Compute Engine GCP, Nginx Proxy Manager
- Note: The project is private on GitHub, but I can provide access for review upon request.
Projects
Conducted fine-tuning of the 'NousResearch/Llama-2-7b-chat-hf' model using the 'mlabonne/guanaco-llama2-1k' dataset to create the customized 'llama-2-7b-miniguanaco' model. Implemented QLoRA with parameters such as a 64-dimension LoRA attention layer and 4-bit precision to optimize the model’s performance while maintaining resource efficiency. Configured and trained the model with advanced hyperparameters including a cosine learning rate schedule, gradient checkpointing, and specific optimizer settings like paged_adamw_32bit. Fine-tuning was performed using Hugging Face Transformers and PyTorch, achieving optimized performance in language modeling tasks.
Fine Tuning - Python - Hugging Face - PyTorch - LLaMA2 - QLoRA
Developed a Convolutional Neural Network (CNN) model for image classification, achieving an accuracy of 85.72% on the test set. Preprocessed image data using techniques such as data augmentation and normalization to improve model generalization. Implemented the model using TensorFlow/Keras and optimized its performance through hyperparameter tuning.
Python - TensorFlow - Convolutional Neural Network - Data Augmentation
Led the development of an AI-based product profitability prediction system for an e-commerce platform, plus extra services such as text generators using OpenAi API, the project covers from backend and frontend development to cloud deployment. Utilized data balancing techniques such as SMOTE during the training phase to enhance dataset accuracy and improve model performance
Python - Random Forest - SMOTE - Scrapy
Developed a machine learning model to predict the likelihood of a stroke using medical data, achieving an accuracy of 91.12% in validation tests. Utilized feature engineering and model selection techniques to optimize model accuracy, including Random Forest and XGBoost. Implemented data balancing techniques such as SMOTE to address class imbalance in the target variable.
Python - Random Forest - XGBOOST - Feature Engineering
Reseach Projects
Topic modeling to find functionalities within a user review of a Google Play Store app using unsupervised algorithms and sentences embeddings with Transformers. This research contributed to an enhanced understanding of user needs and informed app development priorities.
Natural Language Proccessing - Topic Modelling - Python - Transformers - Clustering - Embeddings