Hyogeun Oh (오효근)

🌱 Continuously Growing Engineer

With a passion for learning and growth, I have been running a technical blog since 2018 on topics like Kubernetes, PyTorch, and Airflow, achieving 1,500+ monthly active users (MAU). I have participated in 10+ study groups, enjoying exploring the core principles of technology and applying them to solve real-world challenges.

🚀 Efficiency-Maximizing Engineer

I automate inefficient repetitive tasks and seek efficient methods for operational improvement. I developed a Python backend for model serving and optimized pipelines to enhance efficiency. By utilizing GitLab CI and Kubernetes, I built pipelines and deployed using the sidecar pattern, resolving version control issues and significantly reducing build times.

🫂 Collaboration-Focused Engineer

I integrated project source codes using Git Submodule to create a seamless collaborative environment. I also introduced MLOps tools like DVC and MLflow into the team's workflow, greatly improving collaboration on data and models and fostering a productive team atmosphere.

Latest Updated 2025. 08. 10 (D+0)

Zerohertz

SKILLS

Programming

3 Python
2 C, C++
2 JavaScript, TypeScript
1 Java
1 Go

3 Python
2 C, C++

2 JavaScript, TypeScript
1 Java

1 Go

MLOps

3 Triton Inference Server
2 vLLM
2 TensorRT
2 ONNX
2 Amazon EC2 Inf1
2 DVC
1 Kubeflow
1 MLflow

3 Triton Inference Server
2 vLLM
2 TensorRT

2 ONNX
2 Amazon EC2 Inf1
2 DVC

1 Kubeflow
1 MLflow

Infra

3 Docker
3 Kubernetes
2 GitHub Actions
2 MinIO
1 Argo CD
1 Grafana
1 Prometheus
1 Traefik

3 Docker
3 Kubernetes
2 GitHub Actions

2 MinIO
1 Argo CD
1 Grafana

1 Prometheus
1 Traefik

Data Engineering

2 Apache Airflow
1 Apache Kafka

2 Apache Airflow

1 Apache Kafka

Database

2 SQL
1 Redis
1 MongoDB

2 SQL

1 Redis

1 MongoDB

Etc

Ansys
Arduino
Catia
LabVIEW
MATLAB
R
Raspberry Pi
Unreal Engine

Ansys
Arduino
Catia

LabVIEW
MATLAB
R

Raspberry Pi
Unreal Engine

EXPERIENCES

Total 2 years 7 months

2024. 12 ~

Present9 mos

GenON

Machine Learning Engineer

Designed and implemented a GPU resource scheduler, a Kubernetes resource controller, and comprehensive LLM lifecycle management modules for the GenOS LLMOps product, enhancing resource efficiency and system reliability.

Developed distributed training and serving pipelines for large language models (LLMs) and multimodal large language models (MLLMs) on a Kubernetes platform, which improved scalability and boosted performance.

Collaborated closely with the R&D team to productize cutting-edge model training code, streamlining integration and accelerating time-to-market for advanced machine learning solutions.

2024. 09 ~ 2024. 11

3 mos

Tmax WAPL

Backend Engineer

Developed backend infrastructure for WAPL, a collaboration platform, using microservices architecture (MSA) within an 11-member team.

Developed scheduling functionality utilizing a Netty-based in-house Java backend framework, supporting the efficient creation, deletion, and retrieval of schedules.

2023. 02 ~ 2024. 09

1 yr 8 mos

AgileSoDA

Machine Learning Research Engineer

Managed the entire lifecycle of machine learning services (annotation, modeling, training, deployment) and oversaw Kubernetes-based IDC infrastructure within an 11-member team.

Researched and developed models for text, signature, and checkbox detection, as well as information extraction, for the AI optical character recognition (OCR) product TwinReader.

Developed a Python backend for model serving and optimized pipelines to enhance efficiency.

Streamlined the development process by creating a solution that resolved versioning challenges and reduced build times through the separation of backend dependencies.

Executed AI projects and proof of concept (PoC) to meet client specifications.

2021. 03 ~ 2023. 02

2 yrs 0 mo

SiM Lab. (Smart intelligent Manufacturing system Laboratory)

Research Student

Researched and developed diagnostic models utilizing time-series and vision data for real-time monitoring of process conditions in industrial-scale manufacturing systems.

Published two SCI(E) Q1-ranked research papers on feature selection algorithms, advancing efficient machine learning methodologies in industrial applications.

Executed 10+ government and industry-funded projects, further enhancing research capabilities in machine learning.

2019. 11 ~ 2021. 02

1 yr 4 mos

SiM Lab. (Smart intelligent Manufacturing system Laboratory)

Research Intern

Developed bearing condition diagnostic model and graphic user interface.

2018. 06 ~ 2019. 11

1 yr 6 mos

MRV Lab. (Medical Robotics and Virtual Reality Laboratory)

Research Intern

Developed virtual reality environments based on C++ and Unreal Engine.

PROJECTS

2024. 12 ~

LLMOps Solution, GenOS

GenON

Conducted continuous evaluation and benchmarking of state-of-the-art Large Language Models (LLMs) to determine optimal computing resource requirements and configuration parameters for training and inference deployment.

Designed and developed microservices architecture (MSA) enabling distributed training and serving configuration through intuitive web-based user interfaces.

Implemented Prometheus operator integration and developed comprehensive Grafana dashboards to enhance observability and monitoring capabilities for LLM serving infrastructure.

Resolved ephemeral storage limitations by migrating from object storage to Network File System (NFS) provisioner, improving system reliability and performance.

Led strategic planning and architectural design for Model Context Protocol (MCP) feature integration and implementation.

2023. 11 ~ 2024. 12

Development and CI/CD Pipeline Construction of Python Library

Personal Project

Zerohertz/zerohertzLib (GitHub)

Python Library (PyPI)

Docs (Sphinx)

To reduce time consumption and inefficiency from reimplementing commonly used functions, developed and published a custom Python library on PyPI and GitHub Releases to enhance efficiency and code reusability across projects.

Built a GitHub Actions-based CI/CD pipeline (migrated from Jenkins) to automate repetitive tasks such as formatting, unit testing, and deployment, streamlining the process for feature additions and bug fixes.

To prevent unnecessary deployments from non-production changes like documentation updates, implemented a detailed branching strategy on GitHub and set up dedicated pipelines for code segregation.

Simplified version tracking by building a pipeline using the GitHub API to automatically generate and publish release notes to GitHub Releases, improving transparency across development cycles.

Ensured easy access to comprehensive project guidelines and function usage by creating a Sphinx-based documentation pipeline, deploying it via GitHub Pages for consistent and up-to-date project documentation.

2023. 10 ~ 2024. 09

Python Library for Pre/Post-Processing, Visualization, and Model Backend

AgileSoDA

Packaged frequently used classes and functions within the model backend into a Python library to streamline development processes.

Utilized Docstring to document functions and classes, enhancing code clarity and team collaboration, while maintaining library integrity through type hints and PyTest.

Faced with significant compatibility issues due to inconsistent libraries and formats for model outputs, standardized the data format for preprocessing and model inference visualization, enabling consistent visualization and resolving unexpected compatibility problems.

Addressed inefficiencies in post-processing due to Python-native functions with high time complexity by optimizing them with Cython-native functions and improving time complexity. (inference time decreased by 74.12%)

Developed a unified class and inheritance structure for Triton Inference Server.

2023. 02 ~ 2024. 09

AI-based OCR Solution, TwinReader

AgileSoDA

Developed models for document area detection, rotated document classification, and detection of text, signatures, and checkboxes, along with a Python backend for model deployment.

Integrated project source codes using Git Submodule to facilitate a smooth collaborative environment.

Implemented a pipeline using GitLab CI and Kubernetes to separate backend dependencies from code and weights, deploying through the Kubernetes sidecar pattern, which resolved versioning challenges and significantly reduced build times for the model backend.

Faced with excessive GPU usage during model deployment, resolved the issue by identifying and fixing a memory leak through GPU resource monitoring and logging. (GPU memory usage reduced by 47.9%)

Reduced inference time for a text detection model, where frequent calls made optimization critical, by utilizing TensorRT-based quantization. (inference time decreased by 87.31%)

Encountered low accuracy in document rotation classification, addressed by performing batch inference on image tensors rotated in four directions and averaging the results. (improved accuracy by 2.01%p)

2024. 02 ~ 2024. 05

Information Extraction Model Development for Trade Document Processing

AgileSoDA

Performed clustering, annotation, preprocessing, training, and deployment to develop a model for extracting information from a wide variety of trade document formats.

Faced with the challenge of categorizing large volumes of unstructured PDF documents, developed an AI OCR-based pipeline utilizing OCR results and LLM prompting to efficiently classify and sort documents. (achieved 93.75% accuracy)

To address the high time and cost demands of large-scale data annotation requiring expert knowledge, accelerated the process by implementing pre-labeling through an ML backend using Label Studio SDK, significantly reduced annotation time and costs.

Encountered difficulties in manually checking complex human errors during annotation review, developed a Streamlit-based GUI to allow easy detection and correction of these errors through simple configurations.

2023. 11 ~ 2024. 02

AI-based Automotive Damage and Depth Recognition Product, TwinCar

AgileSoDA

Developed a vehicle type classification model for PoC execution.

Conducted research and development models for filter, part recognition, repair type, and damage type, along with the model inference pipeline.

Developed a demo page using Streamlit and deployed it on Kubernetes.

2023. 04 ~ 2023. 11

AI Diagnostic Service for Burn Patients

AgileSoDA

Developed models for segmentation of burn areas and severity diagnosis in burn patients.

Designed and developed an API for model deployment using Triton Inference Server.

2023. 08 ~ 2023. 09

On-premise Kubernetes Cluster

Personal Project

Zerohertz/k8s-on-premise (GitHub)

Blog posts

Installed Kubernetes using Kubeadm on an on-premise environment to enhance understanding of Kubernetes architecture and practical usage.

Secured deployed services by implementing HTTPS protocol and Google OAuth2 through Traefik.

Established GitOps by automating build and deployment processes using GitHub Actions and Argo CD.

Built a node status monitoring GUI leveraging Node Exporter, Prometheus, and Grafana.

Automated various tasks using Apache Airflow integrated with KubernetesPodOperator.

Set up a Docker image build and deployment pipeline using Jenkins and Kaniko.

2023. 01 ~ 2023. 07

Mosaic Classification

BOAZ

Team-BoonMoSa (GitHub)

Developed a YOLOv5 based logo segmentation model.

Constructed a model deployment server on Amazon EC2 Inf1.

PRESENTATIONS

2025. 08

Open Source LLMs, from vLLM to Production

Instruct.KR Summer 2025 Meetup

오픈소스 LLM, vLLM으로 Production까지

GitHub

SlideShare

OPEN SOURCES

vllm-project/vllm

Led comprehensive documentation modernization initiative by converting legacy Sphinx-style directives to MkDocs-compatible format, improving cross-referencing and API documentation linking.

Resolved critical markdown rendering issues by implementing proper list indentation configuration in pymarkdown, enhancing documentation readability for MkDocs-based system.

Updated deprecation message for --enable-reasoning argument, correcting version reference from non-existent v0.8.6 to accurate v0.9.0 deprecation timeline.

Conducted comprehensive performance analysis comparing v0 vs. v1 engines and version stability (v0.8.4 vs. v0.8.5.post1) across multi-GPU environments, providing detailed technical documentation for engine optimization and issue reproduction.

fastapi/fastapi

Reviewed and enhanced the Korean FastAPI documentation by addressing missing content and improving formatting.

Fixed issues in FastAPI docs by updating code for the latest version.

microsoft/DeepSpeed

Remove duplicate declaration of pandas in Dockerfile.

bytedance/SPTSv2

Enhanced environment compatibility by removing python==3.7 from requirements.txt, enabling broader setup compatibility for SPTSv2.

Corrected variable type mismatch by aligning the depths variable to a list type for consistency with its default value, enhancing code clarity and reducing runtime errors.

Generalized configuration by implementing customizable parameters like max_length in data loading and model setup, improving SPTSv2 adaptability for varied use cases.

Optimized memory usage in inference by adding the @torch.no_grad decorator in predict.py, significantly reducing GPU memory requirements.

Resolved IndexError during training with customized data by fixing shape mismatches in GT data, ensuring stability in data handling.

Addressed tensor dimension errors and generalized prediction, evaluation, and visualization processes.

streamlit/streamlit

Identified and verified a dependency mismatch with Image.Resampling.BILINEAR (Pillow >=9.1.0).

Conducted version testing and recommended an update to Streamlit’s requirements, improving reliability for developers by preventing compatibility issues.

next-theme/hexo-theme-next

Added a motion.duration parameter in Hexo's NexT theme _config.yml, enabling flexible configuration of motion animation duration.

Modified source/js/motion.js to retrieve motion.duration dynamically, with a default value fallback for robustness.

kubernetes/website

Contributed minor wording refinements to improve grammatical accuracy.

Technical Blog

Customized a technical blog based on the Hexo NexT theme to document and share solutions to challenges encountered during personal learning and professional work.

Achieved 1,500+ MAU and 2,600+ monthly page views by consistently writing 200+ posts since 2018.

Zerohertz/awesome-jmy

Implemented an automated data ingestion and preprocessing pipeline using GitHub Actions to enhance data workflow efficiency.

Delivered insights to technical research personnel (전문연구요원) through data visualizations created with Matplotlib, supporting decision-making from multiple analytical perspectives.

Zerohertz/awesome-sgy

Implemented an automated data ingestion and preprocessing pipeline using GitHub Actions to enhance data workflow efficiency.

Delivered insights to skilled industrial personnel (산업기능요원) through data visualizations created with Matplotlib, supporting decision-making from multiple analytical perspectives.