my name is Minyang, welcome to my github repositories. I am software enginner living in Greater Toronto Area, Canada.
I architected, designed and implemented business critical hub placement, billing and financial solution at world largest commercial insurance broker Marsh Canada (parent: MMC). As tech lead for one of the Canada largest Bank BMO to complete the banking systems modernization with real-time lead management and offering capabilities. At IBM Canada - Watson health tackling big data challenges in Health Care. developing secure privacy first platform, Cloud cognitive services and big data normalization pipelines, data protection and de-identification framework for analytics, data linking and AI training use cases.
I found my passion for AI, big data, cloud and mental model concepts many years ago. Since then I never stopped learning. Currently, I am focusing myself in the area Generative AI/Large Language Model (LLM) fine-tuning and knowledge enrichment with private data integration and how to leverage fine-tuned models to generate business value and assist problem-solving for an enterprise.
- Email: https://mychen76@gmail.com
- Medium: https://medium.com/@mychen76
- Huggingface: https://huggingface.co/mychen76
- Github: https://github.com/minyang-chen
- AI.Researches: https://github.com/PavAI-Research
- Dev.Lab: https://github.com/McLabGalaxy
- linkedin: https://www.linkedin.com/in/minyang-chen
I believe that one should not have a fixed technology stack, but should always respond to the needs and problems of the customer and business needs. Below is just an excerpt and my favorite technologies in machine learning, cloud and non-ML frameworks.
🤖 Generative AI/ LLM
OpenAI ChatGPT, LLama2, Alpaca, H2OGPT
🤖 Machine Learning
LangChain,LlamIndex, Huggingface Transformers, Pytorch, Scikit-Learn, Tensorflow, Weights & Bias, Optuna, Pandas, Numpy
☁️ Cloud
AWS, GCP, Azure, Kubernetes, Kubeflow, Docker, Terraform, AWS CDK, Github Actions, Serverless Framework
☁️ Big Data
DuckDB, Apache Spark,Apache Iceberg, Cassandra, Hadoop,Delta Lake and Talend
☁️ Data Privacy
Microsoft presidio, postgresql anonymizer
🏗️ Frameworks
Poetry, Angular, Spring, Rust, React, Svelte, React-Native, LitElement, GraphQL, Gatsby, TailwindCSS
🏗️ Architectures
Clean, Event Driven, SOA, Distributed and decentralized, Polylith
Here are some ideas to write about:
- 🔭 I’m currently working on ... [AI researches, LLM model Merging, LLM fine-tuning and private knowledge integration]
- 🌱 I’m currently learning ...
- 👯 I’m looking to collaborate on ...
- 🤔 I’m looking for help with ...
- 💬 Ask me about ...
- 📫 How to reach me: ...
- 😄 Pronouns: ...
- ⚡ Fun fact: ...