INTRODUCTION

🌱 Continuously Growing Engineer

With a passion for learning and growth, I have been running a technical blog since 2018 on topics like Kubernetes, PyTorch, and Airflow, achieving 1,500+ monthly active users (MAU). I have participated in 10+ study groups, enjoying exploring the core principles of technology and applying them to solve real-world challenges.

🚀 Efficiency-Maximizing Engineer

I automate inefficient repetitive tasks and seek efficient methods for operational improvement. I developed a Python backend for model serving and optimized pipelines to enhance efficiency. By utilizing GitLab CI and Kubernetes, I built pipelines and deployed using the sidecar pattern, resolving version control issues and significantly reducing build times.

🫂 Collaboration-Focused Engineer

I integrated project source codes using Git Submodule to create a seamless collaborative environment. I also introduced MLOps tools like DVC and MLflow into the team's workflow, greatly improving collaboration on data and models and fostering a productive team atmosphere.

Latest Updated 2024. 11. 21 (D+0)

Zerohertz

SKILLS

Programming

  • 3 Python
  • 3 MATLAB
  • 2 Java
  • 2 C, C++
  • 2 Cython
  • 2 R
  • 1 Go
  • 3 Python
  • 3 MATLAB
  • 2 Java
  • 2 C, C++
  • 2 Cython
  • 2 R
  • 1 Go

MLOps

  • 3 Triton Inference Server
  • 2 TensorRT
  • 2 ONNX
  • 2 Amazon EC2 Inf1
  • 2 DVC
  • 1 Kubeflow
  • 1 MLflow
  • 3 Triton Inference Server
  • 2 TensorRT
  • 2 ONNX
  • 2 Amazon EC2 Inf1
  • 2 DVC
  • 1 Kubeflow
  • 1 MLflow

Infra

  • 3 Docker
  • 3 Kubernetes
  • 2 GitHub Actions
  • 2 MinIO
  • 1 Argo CD
  • 1 Grafana
  • 1 Prometheus
  • 1 Traefik
  • 3 Docker
  • 3 Kubernetes
  • 2 GitHub Actions
  • 2 MinIO
  • 1 Argo CD
  • 1 Grafana
  • 1 Prometheus
  • 1 Traefik

Data Engineering

  • 2 Apache Airflow
  • 1 Apache Kafka
  • 2 Apache Airflow
  • 1 Apache Kafka

    Database

    • 2 PostgreSQL
    • 1 MySQL
    • 1 Redis
    • 2 PostgreSQL
    • 1 MySQL
    • 1 Redis

    Etc

    • Ansys
    • Arduino
    • Catia
    • LabVIEW
    • Raspberry Pi
    • Unreal Engine
    • Ansys
    • Arduino
    • Catia
    • LabVIEW
    • Raspberry Pi
    • Unreal Engine

    EXPERIENCES

    Total 1 year 11 months
    Total 1 year 11 months

    2024. 09 ~

    Present3 mos

    Tmax WAPL

    Backend Engineer
    https://img.shields.io/badge/Java-white?style=flat&logo=coffeescript&logoColor=white&color=c04545https://img.shields.io/badge/Tibero-white?style=flat&logo=oracle&logoColor=white&color=c04545https://img.shields.io/badge/Docker-white?style=flat&logo=Docker&logoColor=white&color=c04545https://img.shields.io/badge/Kubernetes-white?style=flat&logo=Kubernetes&logoColor=white&color=c04545
    • Developed backend infrastructure for WAPL, a collaboration platform, using microservices architecture (MSA) within an 11-member team.

    • Developed scheduling functionality utilizing a Netty-based in-house Java backend framework, supporting the efficient creation, deletion, and retrieval of schedules.


    2023. 02 ~ 2024. 09

    1 yr 8 mos

    AgileSoDA

    Machine Learning Research Engineer
    https://img.shields.io/badge/Python-white?style=flat&logo=Python&logoColor=white&color=c04545https://img.shields.io/badge/Docker-white?style=flat&logo=Docker&logoColor=white&color=c04545https://img.shields.io/badge/Kubernetes-white?style=flat&logo=Kubernetes&logoColor=white&color=c04545https://img.shields.io/badge/Research-white?style=flat&logo=internetarchive&logoColor=white&color=c04545https://img.shields.io/badge/Machine Learning-white?style=flat&logo=openai&logoColor=white&color=c04545
    • Managed the entire lifecycle of machine learning services (annotation, modeling, training, deployment) and oversaw Kubernetes-based IDC infrastructure within an 11-member team.

    • Researched and developed models for text, signature, and checkbox detection, as well as information extraction, for the AI optical character recognition (OCR) product TwinReader.

    • Developed a Python backend for model serving and optimized pipelines to enhance efficiency.

    • Streamlined the development process by creating a solution that resolved versioning challenges and reduced build times through the separation of backend dependencies.

    • Executed AI projects and proof of concept (PoC) to meet client specifications.


    2021. 03 ~ 2023. 02

    2 yrs 0 mo

    SiM Lab. (Smart intelligent Manufacturing system Laboratory)

    Research Student
    https://img.shields.io/badge/Python-white?style=flat&logo=Python&logoColor=white&color=c04545https://img.shields.io/badge/C++-white?style=flat&logo=C%2B%2B&logoColor=white&color=c04545https://img.shields.io/badge/MATLAB-white?style=flat&logo=mcdonalds&logoColor=white&color=c04545https://img.shields.io/badge/LabVIEW-white?style=flat&logo=LabVIEW&logoColor=white&color=c04545https://img.shields.io/badge/Ansys-white?style=flat&logo=Ansys&logoColor=white&color=c04545https://img.shields.io/badge/Machine Learning-white?style=flat&logo=openai&logoColor=white&color=c04545
    • Researched and developed diagnostic models utilizing time-series and vision data for real-time monitoring of process conditions in industrial-scale manufacturing systems.

    • Published two SCI(E) Q1-ranked research papers on feature selection algorithms, advancing efficient machine learning methodologies in industrial applications.

    • Executed 10+ government and industry-funded projects, further enhancing research capabilities in machine learning.


    2019. 11 ~ 2021. 02

    1 yr 4 mos

    SiM Lab. (Smart intelligent Manufacturing system Laboratory)

    Research Intern
    https://img.shields.io/badge/MATLAB-white?style=flat&logo=mcdonalds&logoColor=white&color=c04545https://img.shields.io/badge/Research-white?style=flat&logo=internetarchive&logoColor=white&color=c04545https://img.shields.io/badge/Machine Learning-white?style=flat&logo=openai&logoColor=white&color=c04545
    • Developed bearing condition diagnostic model and graphic user interface.


    2018. 06 ~ 2019. 11

    1 yr 6 mos

    MRV Lab. (Medical Robotics and Virtual Reality Laboratory)

    Research Intern
    https://img.shields.io/badge/Unreal Engine-white?style=flat&logo=Unreal Engine&logoColor=white&color=c04545https://img.shields.io/badge/C++-white?style=flat&logo=C%2B%2B&logoColor=white&color=c04545
    • Developed virtual reality environments based on C++ and Unreal Engine.

    PROJECTS

    2023. 11 ~

    Development and CI/CD Pipeline Construction of Python Library

    Personal Project
    https://img.shields.io/badge/Python-white?style=flat&logo=Python&logoColor=white&color=c04545https://img.shields.io/badge/PyTest-white?style=flat&logo=PyTest&logoColor=white&color=c04545https://img.shields.io/badge/Sphinx-white?style=flat&logo=Sphinx&logoColor=white&color=c04545https://img.shields.io/badge/Jenkins-white?style=flat&logo=Jenkins&logoColor=white&color=c04545https://img.shields.io/badge/GitHub Actions-white?style=flat&logo=GitHub Actions&logoColor=white&color=c04545https://img.shields.io/badge/Codecov-white?style=flat&logo=Codecov&logoColor=white&color=c04545https://img.shields.io/badge/PyPI-white?style=flat&logo=PyPI&logoColor=white&color=c04545
    • Zerohertz/zerohertzLib (GitHub)
    • Python Library (PyPI)
    • Docs (Sphinx)
    • To reduce time consumption and inefficiency from reimplementing commonly used functions, developed and published a custom Python library on PyPI and GitHub Releases to enhance efficiency and code reusability across projects.

      images/dacp-0.png
    • Built a GitHub Actions-based CI/CD pipeline (migrated from Jenkins) to automate repetitive tasks such as formatting, unit testing, and deployment, streamlining the process for feature additions and bug fixes.

      images/dacp-1.png
    • To prevent unnecessary deployments from non-production changes like documentation updates, implemented a detailed branching strategy on GitHub and set up dedicated pipelines for code segregation.

      images/dacp-2.png
    • Simplified version tracking by building a pipeline using the GitHub API to automatically generate and publish release notes to GitHub Releases, improving transparency across development cycles.

    • Ensured easy access to comprehensive project guidelines and function usage by creating a Sphinx-based documentation pipeline, deploying it via GitHub Pages for consistent and up-to-date project documentation.

      images/dacp-3.png

    2023. 10 ~ 2024. 09

    Python Library for Pre/Post-Processing, Visualization, and Model Backend

    AgileSoDA
    https://img.shields.io/badge/Python-white?style=flat&logo=Python&logoColor=white&color=c04545https://img.shields.io/badge/PyTest-white?style=flat&logo=PyTest&logoColor=white&color=c04545https://img.shields.io/badge/Sphinx-white?style=flat&logo=Sphinx&logoColor=white&color=c04545https://img.shields.io/badge/Numpy-white?style=flat&logo=Numpy&logoColor=white&color=c04545https://img.shields.io/badge/OpenCV-white?style=flat&logo=OpenCV&logoColor=white&color=c04545https://img.shields.io/badge/PyTorch-white?style=flat&logo=PyTorch&logoColor=white&color=c04545https://img.shields.io/badge/Triton Inference Server-white?style=flat&logo=nvidia&logoColor=white&color=c04545
    • Packaged frequently used classes and functions within the model backend into a Python library to streamline development processes.

    • Utilized Docstring to document functions and classes, enhancing code clarity and team collaboration, while maintaining library integrity through type hints and PyTest.

    • Faced with significant compatibility issues due to inconsistent libraries and formats for model outputs, standardized the data format for preprocessing and model inference visualization, enabling consistent visualization and resolving unexpected compatibility problems.

    • Addressed inefficiencies in post-processing due to Python-native functions with high time complexity by optimizing them with Cython-native functions and improving time complexity. (inference time decreased by 74.12%)

      images/plfp-0.png
    • Developed a unified class and inheritance structure for Triton Inference Server.


    2023. 02 ~ 2024. 09

    AI-based OCR Solution, TwinReader

    AgileSoDA
    https://img.shields.io/badge/Python-white?style=flat&logo=Python&logoColor=white&color=c04545https://img.shields.io/badge/PyTorch-white?style=flat&logo=PyTorch&logoColor=white&color=c04545https://img.shields.io/badge/Docker-white?style=flat&logo=Docker&logoColor=white&color=c04545https://img.shields.io/badge/Kubernetes-white?style=flat&logo=Kubernetes&logoColor=white&color=c04545https://img.shields.io/badge/Triton Inference Server-white?style=flat&logo=nvidia&logoColor=white&color=c04545https://img.shields.io/badge/GitLab-white?style=flat&logo=GitLab&logoColor=white&color=c04545https://img.shields.io/badge/DVC-white?style=flat&logo=DVC&logoColor=white&color=c04545https://img.shields.io/badge/MinIO-white?style=flat&logo=MinIO&logoColor=white&color=c04545
    • Developed models for document area detection, rotated document classification, and detection of text, signatures, and checkboxes, along with a Python backend for model deployment.

    • Integrated project source codes using Git Submodule to facilitate a smooth collaborative environment.

    • Implemented a pipeline using GitLab CI and Kubernetes to separate backend dependencies from code and weights, deploying through the Kubernetes sidecar pattern, which resolved versioning challenges and significantly reduced build times for the model backend.

      images/abos-0.png
    • Faced with excessive GPU usage during model deployment, resolved the issue by identifying and fixing a memory leak through GPU resource monitoring and logging. (GPU memory usage reduced by 47.9%)

      images/abos-1.png
    • Reduced inference time for a text detection model, where frequent calls made optimization critical, by utilizing TensorRT-based quantization. (inference time decreased by 87.31%)

      images/abos-2.png
    • Encountered low accuracy in document rotation classification, addressed by performing batch inference on image tensors rotated in four directions and averaging the results. (improved accuracy by 2.01%p)

      images/abos-3.png

    2024. 02 ~ 2024. 05

    Information Extraction Model Development for Trade Document Processing

    AgileSoDA
    https://img.shields.io/badge/Python-white?style=flat&logo=Python&logoColor=white&color=c04545https://img.shields.io/badge/PyTorch-white?style=flat&logo=PyTorch&logoColor=white&color=c04545https://img.shields.io/badge/Docker-white?style=flat&logo=Docker&logoColor=white&color=c04545https://img.shields.io/badge/Kubernetes-white?style=flat&logo=Kubernetes&logoColor=white&color=c04545https://img.shields.io/badge/Triton Inference Server-white?style=flat&logo=nvidia&logoColor=white&color=c04545https://img.shields.io/badge/Label Studio-white?style=flat&logo=materialdesignicons&logoColor=white&color=c04545https://img.shields.io/badge/Streamlit-white?style=flat&logo=Streamlit&logoColor=white&color=c04545
    • Performed clustering, annotation, preprocessing, training, and deployment to develop a model for extracting information from a wide variety of trade document formats.

    • Faced with the challenge of categorizing large volumes of unstructured PDF documents, developed an AI OCR-based pipeline utilizing OCR results and LLM prompting to efficiently classify and sort documents. (achieved 93.75% accuracy)

    • To address the high time and cost demands of large-scale data annotation requiring expert knowledge, accelerated the process by implementing pre-labeling through an ML backend using Label Studio SDK, significantly reduced annotation time and costs.

    • Encountered difficulties in manually checking complex human errors during annotation review, developed a Streamlit-based GUI to allow easy detection and correction of these errors through simple configurations.


    2023. 11 ~ 2024. 02

    AI-based Automotive Damage and Depth Recognition Product, TwinCar

    AgileSoDA
    https://img.shields.io/badge/Python-white?style=flat&logo=Python&logoColor=white&color=c04545https://img.shields.io/badge/PyTorch-white?style=flat&logo=PyTorch&logoColor=white&color=c04545https://img.shields.io/badge/Docker-white?style=flat&logo=Docker&logoColor=white&color=c04545https://img.shields.io/badge/Kubernetes-white?style=flat&logo=Kubernetes&logoColor=white&color=c04545https://img.shields.io/badge/Triton Inference Server-white?style=flat&logo=nvidia&logoColor=white&color=c04545https://img.shields.io/badge/Streamlit-white?style=flat&logo=Streamlit&logoColor=white&color=c04545
    • Developed a vehicle type classification model for PoC execution.

    • Conducted research and development models for filter, part recognition, repair type, and damage type, along with the model inference pipeline.

    • Developed a demo page using Streamlit and deployed it on Kubernetes.


    2023. 04 ~ 2023. 11

    AI Diagnostic Service for Burn Patients

    AgileSoDA
    https://img.shields.io/badge/Python-white?style=flat&logo=Python&logoColor=white&color=c04545https://img.shields.io/badge/PyTorch-white?style=flat&logo=PyTorch&logoColor=white&color=c04545https://img.shields.io/badge/Docker-white?style=flat&logo=Docker&logoColor=white&color=c04545https://img.shields.io/badge/Kubernetes-white?style=flat&logo=Kubernetes&logoColor=white&color=c04545https://img.shields.io/badge/Triton Inference Server-white?style=flat&logo=nvidia&logoColor=white&color=c04545https://img.shields.io/badge/FastAPI-white?style=flat&logo=FastAPI&logoColor=white&color=c04545https://img.shields.io/badge/Gradio-white?style=flat&logo=huggingface&logoColor=white&color=c04545
    • Developed models for segmentation of burn areas and severity diagnosis in burn patients.

    • Designed and developed an API for model deployment using Triton Inference Server.

      images/adsf-0.png

    2023. 08 ~ 2023. 09

    On-premise Kubernetes Cluster

    Personal Project
    https://img.shields.io/badge/Kubernetes-white?style=flat&logo=Kubernetes&logoColor=white&color=c04545https://img.shields.io/badge/Traefik-white?style=flat&logo=traefikproxy&logoColor=white&color=c04545https://img.shields.io/badge/Prometheus-white?style=flat&logo=Prometheus&logoColor=white&color=c04545https://img.shields.io/badge/Grafana-white?style=flat&logo=Grafana&logoColor=white&color=c04545https://img.shields.io/badge/Apache Airflow-white?style=flat&logo=Apache Airflow&logoColor=white&color=c04545https://img.shields.io/badge/Argo CD-white?style=flat&logo=argo&logoColor=white&color=c04545https://img.shields.io/badge/Jenkins-white?style=flat&logo=Jenkins&logoColor=white&color=c04545
    • Zerohertz/k8s-on-premise (GitHub)
      images/opkc-0.png
    • Blog posts
      images/opkc-1.png
    • Installed Kubernetes using Kubeadm on an on-premise environment to enhance understanding of Kubernetes architecture and practical usage.

    • Secured deployed services by implementing HTTPS protocol and Google OAuth2 through Traefik.

    • Established GitOps by automating build and deployment processes using GitHub Actions and Argo CD.

    • Built a node status monitoring GUI leveraging Node Exporter, Prometheus, and Grafana.

    • Automated various tasks using Apache Airflow integrated with KubernetesPodOperator.

    • Set up a Docker image build and deployment pipeline using Jenkins and Kaniko.


    2023. 01 ~ 2023. 07

    Mosaic Classification

    BOAZ
    https://img.shields.io/badge/Python-white?style=flat&logo=Python&logoColor=white&color=c04545https://img.shields.io/badge/PyTorch-white?style=flat&logo=PyTorch&logoColor=white&color=c04545https://img.shields.io/badge/Docker-white?style=flat&logo=Docker&logoColor=white&color=c04545https://img.shields.io/badge/Triton Inference Server-white?style=flat&logo=nvidia&logoColor=white&color=c04545https://img.shields.io/badge/Amazon EC2-white?style=flat&logo=amazonec2&logoColor=white&color=c04545https://img.shields.io/badge/FastAPI-white?style=flat&logo=FastAPI&logoColor=white&color=c04545
    • Team-BoonMoSa (GitHub)
      images/mc-0.gif
    • Developed a YOLOv5 based logo segmentation model.

    • Constructed a model deployment server on Amazon EC2 Inf1.

      images/mc-1.png

    PUBLICATIONS & PATENTS

    2023. 09


    2023. 01


    2022. 04

    OPEN SOURCES

    • Customized a technical blog based on the Hexo NexT theme to document and share solutions to challenges encountered during personal learning and professional work.

    • Achieved 1,500 MAU and 2,600 monthly page views by consistently writing 200+ posts since 2018.


    • Implemented an automated data ingestion and preprocessing pipeline using GitHub Actions to enhance data workflow efficiency.

    • Delivered insights to technical research personnel (전문연구요원) through data visualizations created with Matplotlib, supporting decision-making from multiple analytical perspectives.


    • Implemented an automated data ingestion and preprocessing pipeline using GitHub Actions to enhance data workflow efficiency.

    • Delivered insights to skilled industrial personnel (산업기능요원) through data visualizations created with Matplotlib, supporting decision-making from multiple analytical perspectives.


    • Identified and verified a dependency mismatch with Image.Resampling.BILINEAR (Pillow >=9.1.0).

    • Conducted version testing and recommended an update to Streamlit’s requirements, improving reliability for developers by preventing compatibility issues.


    • Enhanced environment compatibility by removing python==3.7 from requirements.txt, enabling broader setup compatibility for SPTSv2.

    • Corrected variable type mismatch by aligning the depths variable to a list type for consistency with its default value, enhancing code clarity and reducing runtime errors.

    • Generalized configuration by implementing customizable parameters like max_length in data loading and model setup, improving SPTSv2 adaptability for varied use cases.

    • Optimized memory usage in inference by adding the @torch.no_grad decorator in predict.py, significantly reducing GPU memory requirements.

    • Resolved IndexError during training with customized data by fixing shape mismatches in GT data, ensuring stability in data handling.

    • Addressed tensor dimension errors and generalized prediction, evaluation, and visualization processes.


    • Added a motion.duration parameter in Hexo's NexT theme _config.yml, enabling flexible configuration of motion animation duration.

    • Modified source/js/motion.js to retrieve motion.duration dynamically, with a default value fallback for robustness.


    • Contributed minor wording refinements to improve grammatical accuracy.

    EDUCATIONS

    2021. 03 ~ 2023. 02


    2017. 03 ~ 2021. 02

    Konkuk University, Seoul, Korea

    B.S. in Department of Mechanical Engineering

    EXTRAS

    2024. 05 ~ 2026. 05


    2023. 02 ~ 2026. 02

    전문연구요원

    AgileSoDA, Tmax WAPL