Skip to content
View CoderJackZhu's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report CoderJackZhu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
CoderJackZhu/README.md

Hi, I'm Jack Zhu 👋

LLM Agent Engineer @ Baidu Inc.

M.S. in Artificial Intelligence, Xidian University

I focus on LLM Agents, Artificial Intelligence, Computer Vision, and Multimodal AI Systems.

Blog Profile Views GitHub Followers GitHub Stars


About Me

I am an LLM Agent engineer with a background in artificial intelligence and computer vision.

My current interests include:

  • LLM Agent systems and workflow orchestration
  • Tool use, function calling, RAG, and structured generation
  • Multimodal large language models
  • Computer vision, object detection, and pose estimation
  • AIGC, including image and text generation
  • Multimodal medical image segmentation and glioma classification

Tech Stack

LLM Agents       : Agent Workflow, Tool Calling, Function Calling, Dify, LangChain
RAG Systems      : Retrieval, Reranking, Context Engineering, Structured Generation
AI Models        : PyTorch, Transformers, Multimodal LLMs, Computer Vision, AIGC
Engineering      : Python, C++, FastAPI, Docker, Linux, GitHub Actions
Deployment       : API Integration, Model Serving, Workflow Automation

Focus Areas

LLM Agents        : Planning, tool use, workflow orchestration, structured output
RAG Systems       : Retrieval, reranking, context compression, evidence-grounded generation
Computer Vision   : Detection, segmentation, pose estimation, medical image analysis
AIGC              : Image generation, text generation, multimodal content creation
Engineering       : Python, C++, Linux, Docker, Git, model deployment

Current Direction

I am currently working on practical LLM Agent systems, multimodal AI applications, and engineering-oriented AI workflows.

My long-term interests include:

  • Building reliable and controllable LLM Agent applications
  • Improving structured generation and tool-calling reliability
  • Combining multimodal understanding with real-world AI systems
  • Deploying AI models into production-oriented workflows

Building practical LLM Agent systems and multimodal AI applications.

Pinned Loading

  1. PassiveAgent PassiveAgent Public

    个人注意力调度系统——从信息池中自动筛选高价值内容,LLM 摘要评分后推送到飞书,碎片时间决策、系统执行归档。

    Python 1

  2. MindCtx MindCtx Public

    Markdown 思维导图 & 大纲编辑器——本地优先、LLM 协同、双视图编辑,适配 Obsidian 与 VS Code

    TypeScript 1

  3. lattice lattice Public

    轻量级、可组合的 Python Agent 框架,专为 AI Agent 算法研究与快速实验设计。

    Python 1

  4. XDU_AI_project XDU_AI_project Public

    这是西电2018级智能院本科生大三的大作业相关内容,绝大部分作业代码都公开在这里了,希望能帮助到一些学弟学妹

    Jupyter Notebook 18

  5. XD-AI-graduate_entrance_exam XD-AI-graduate_entrance_exam Public

    考研西电智能院经验分享

    9 1

  6. XDUthesis-Typst XDUthesis-Typst Public

    西安电子科技大学毕业论文Typst模板

    Typst 16 1