Home

Omar Kamal Hosney

Data Scientist / Machine Learning Software Engineer / Individual Contributor

Email: omkamal@gmail.com

Citizenship: US Citizen (willing to relocate)

Twitter (@omkamal) | LinkedIn | GitHub

Summary

With over 30 years of experience in multiple industries and 9 years in machine learning, data science, and analytics, I am a seasoned data scientist skilled in machine learning and Python programming. I have contributed to renowned companies such as CaaStle, Tribal Credit, Reveel, TA Telecom, Mentor Graphics, IBM, HP, and Lucent Technologies. My career has spanned AI-driven underwriting, algorithm development, and data analytics integration.

Certifications

Black Belt Six Sigma
ASQ Certified Quality Manager

Work Experience

CaaStle – Senior Staff Data Scientist
2023-12 to Present

Leading the development of advanced AI-driven systems and frameworks, with a focus on personalized recommendations and innovative solutions. Demonstrated expertise in implementing cutting-edge AI technologies while maintaining cost-effectiveness and scalability.

  • Developed a sophisticated personalized collection framework utilizing embedding and multi-tower models, achieving optimal balance between effectiveness and cost efficiency
  • Created and implemented comprehensive documentation and design practices that became team standards
  • Built an innovative demo tool for showcasing AI frameworks, which was subsequently adopted by the product team for specification and documentation
  • Successfully led technical presentations to leadership, effectively communicating complex AI concepts and trade-offs
  • Maintained cutting-edge knowledge of AI advancements and implemented them in practical business applications

Technology Stack:

  • Deep Learning: PyTorch, TensorFlow, Neural Networks, Multi-Tower Architecture
  • AI/ML: Generative AI, Large Language Models (LLMs), Vector Embeddings, Semantic Search
Tribal Credit – Chief Data Scientist and Founding Team Member
2019-11-15 to 2023-11

Oversaw fintech data strategy combining AI-driven underwriting and advanced analytics. Developed insights from diverse data sources, constructed scientific data models, and designed statistical and data mining products. Led initiatives to design and execute experiments that optimize data-driven business models.

  • Large Language Model implementation
  • Machine Learning systems development
  • Data Science leadership
Reveel – Chief Data Scientist and Founding Team Member
2015-10-15 to 2019-10-15

Directed data analytics for a stealth startup focused on subscription revenue optimization. Spearheaded development of a patent-pending algorithm for optimal business configuration. Introduced innovative "Pi" metric for improvement areas.

  • Python and NumPy development
  • Algorithm design
  • Statistical analysis
TA Telecom – Chief Data Scientist
2014-11-15 to 2015-10-01

Spearheaded the integration of data analytics and led the formation of a dedicated data science team. Implemented comprehensive data analysis solutions across departments.

  • R programming
  • Statistical Analysis
  • Churn Analysis
  • Tableau visualization
SW Testing Manager at Mentor Graphics
2006-10-01 to 2014-11-01

Led software testing and quality assurance initiatives. Developed testing tools and analytics dashboards.

  • Software Testing
  • Quality Management
  • Six Sigma Implementation
Software Engineer/Software Validation Lead at IBM Egypt
2005-09-01 to 2006-12-01

Developed automotive embedded software and validation systems.

Software Testing Team Lead at QuickTel
2003-06-01 to 2005-09-01

Led the Class 5 Switch Project testing team.

Software Engineer at Lucent Technologies
2000-01-01 to 2002-12-31

Developed software solutions and led Quality Management System deployment.

Business Developer at Yalla Inc.
1999-10-01 to 2000-04-01

Conducted technical market research and business development.

Assistant Engineer at Etisalat
1997-01-01 to 1999-12-31

Worked on GSM mobile networks and developed embedded applications.

Projects

Product Recommender with Contextual Data
2024

Developed advanced neural network models using PyTorch for personalized recommendation systems.

ChatGPT-4 Diagram Generation
2023

Designed prompts for ChatGPT-4 enabling generation of diverse diagrams using Mermaid, D3, and PlantUML.

Child-Centric Conversational AI
2023

Engineered a Streamlit-based chatbot using Langchain and OpenAI's GPT-4.

Cash CoPilot
2023

Integrated ChatGPT with Mexican Tax Authority for real-time financial analysis.

Customer Spending Alert System
2022

Developed statistical process control system for financial monitoring.

Bank Statement Parser
2021

Created parsing tool for Mexican bank statements with advanced security features.

Startup-VC Matching Algorithm
2017

Developed matching system using word embeddings and cosine similarity.

Education

Master of Business Administration

City University of Seattle (1999 - 2001)

Diploma - Computer Networking & Software Programming

Information Technology Institute (1996 - 1997)

Bachelor - Electrical & Electronics Engineering

Cairo University (1991 - 1996)

Publications

Technical Blog on Medium.com

Active technical writer publishing tutorials on AI technologies (2024)

Diagrams as Code: Exploring Mermaid, PlantUML, D2 and Generating Diagrams using AI LLMs

Published on Amazon (2023-09-15)

Patent

Apparatus and method for predicting future incremental churn from a recurring revenue product

US 20160189178 A1

Skills

Large Language Models - Master
OpenAI Langchain Anthropic Google Gemini CrewAI ELL ComfyUI Flux.Dev Ollama Torchtune Transformers RAG ChromaDB
Statistical Analysis - Advanced
Hypothesis Testing Regression Analysis Probability Distributions Non-parametric Statistics Multivariate Statistics Time Series Analysis Experimental Design
Data Science - Master
Associative Analysis Network Analysis Natural Language Processing Analytical Hierarchy Process A/B Testing Market Basket Analysis RFM Analysis
Machine Learning - Master
Regression Classification Decision Trees Random Forests XGBoost CatBoost LightGBM K-means DBSCAN Hierarchical Clustering PCA t-SNE UMAP Feature Engineering Ensemble Methods
Python Programming - Master
Python Pandas NumPy Scikit-learn PyTorch FastAPI Seaborn NetworkX Arules Matplotlib
Infrastructure & Cloud - Advanced
AWS Global Infrastructure AWS Management Console AWS CLI EC2 Lambda RDS DynamoDB Aurora S3 Glacier SageMaker Python Boto3 SDK
DevOps and IDE - Intermediate
CI/CD Docker Kubernetes Git Elastic Stack Visual Studio Code Jupyter Notebooks MLflow Vim
Data Warehousing - Beginner
ETL Processes Apache Airflow Redis Snowflake SQL & NoSQL Databases Data Pipeline Construction Kafka
Other Programming Languages - Intermediate
R SQL Go C C++ Perl

Languages

English - Fluent Arabic - Native Speaker