Hi there, thanks for visiting my career page.
I'm Harrison, a senior software & machine learning engineer with eight years of experience including three years specializing in applied Machine learning and Generative AI Platform Engineering.
I am passionate about developing generative AI systems, large language model (LLM) training, and MLOPs infrastructure for production-grade deployment.
I have designed and deployed a production-grade FastAPI microservice powering text generations, embeddings, and integrating LLM provider models into the API enabling streaming responses.
To view and interact with a plugged-in OpenAI LLM application, click here
In the near future, a demo of distributed GPU vLLM servings / KServe deployments will also be hosted