Andrejus Baranovski

Subscribe to Andrejus Baranovski feed
Blog about Oracle, Full Stack, Machine Learning and Cloud
Updated: 16 hours 12 min ago

The old old coolness - HTMX | Execute HTTP request directly from HTML tag

Sat, 2022-08-06 13:47
Developing a modern Web app with Django and Tailwind? Thinking of using React, Vue, or another fancy JS library for UI implementation? I would recommend spending some time and trying HTMX. With HTMX you can execute partial requests and return HTML content to replace web page regions. HTMX properties are defined directly on HTML tags, this makes it easy to use and flexible.


GitHub Copilot with Django Python: Create Django Model

Sat, 2022-07-30 13:45
GitHub Copilot helps to save developer time, it can suggest common utility functions code quite well. There is no need to spend time searching on Google or StackOverflow. I love it and it works very well with Python.


Oracle JavaScript (JET) UI rendered in single HTML file

Sun, 2022-07-24 14:26
Oracle JavaScript Extension Toolkit (JET) comes with a strong set of UI components for enterprise. You can reuse these components in other UI toolkits/frameworks. In this video I show how you can run JET table components in a single HTML file, without any additional configuration in separate files.


Tailwind UI in Django with Flowbite Accordion, Table, Form

Sun, 2022-07-17 14:32
I explain my experience working with Flowbite to build prototype UI for data mapping screen with accordion, table and form components.


MLUI: Webpack, Django, JavaScript

Mon, 2022-07-04 08:24
This quick video explains how to run Webpack inside Django project to package JavaScript files into a single bundle.


MLUI: Django HTML Template Tags

Sun, 2022-06-26 14:33
Django HTML template tags help to implement dynamic logic, processed on the server-side and rendered through HTML response on the client. This helps to integrate HTML structure with Django backend. In this video I explain how to use Django built-in tags in HTML.


MLUI: Django, Tailwind and Flowbite (Katana ML Sparrow)

Sun, 2022-06-19 15:01
This video is about Tailwind and Flowbite integration with Django Python project. With Tailwind and Flowbite you can build modern UIs with Django HTML templates, without using React or Vue.


MLUI: Django HTML Templates

Sun, 2022-06-12 08:54
I explain how to create a Web page in Django using HTML templates and add URL routing to Django view.


MLUI: Django App Setup

Sun, 2022-06-05 14:20
UI plays an essential part for ML apps, it helps build access to ML model API. With friendly and usable UI there are more chances for ML project to be successful. I'm building UI for our ML product Sparrow (data extraction from the documents). I will be explaining in the series of videos, how to build UI (including security, data model, etc.) for ML project. Stay tuned, it will be fun and lots to learn.


MLOps: Simple Kubernetes with Skipper

Tue, 2022-05-31 14:40
Kubernetes can be very complex, this is why we created Skipper. Simple and flexible MLOps workflow engine, it helps to run ML (and not only) workloads on Kubernetes. Skipper comes with all the scripts to setup and run microservices infrastructure on Kubernetes. Try it and enjoy!


File Upload with PyScript

Sun, 2022-05-22 14:32
I explain how to implement file upload functionality with PyScript in the browser. File content is converted to Pandas and displayed in the table. All UI components are native Python. I show how to interact with HTML DOM structure from PyScript.


Data Annotation with SVG and JavaScript

Mon, 2022-05-16 01:35
I explain how to build a simple data annotation tool with SVG and JavaScript in HTML page. The sample code renders two boxes in SVG on top of the receipt image. You will learn how to select and switch between annotation boxes. Enjoy!


PyScript - Deep Dive for Developer

Mon, 2022-05-09 01:52
PyScript was announced last week at PyCon US 2022. Good news for all Python developers, now we can run Python logic serverless in the browser. This video is deep dive, with a step-by-step explanation of the sample application code. It includes an input component, chart, and table. I explain how to update UI when the input component changes. Hope this will be useful for your practical knowledge.


PyScript - Python in the Browser

Mon, 2022-05-02 03:15
Exciting times! On PyCon US 2022 it was announced about PyScript. With PyScript framework, we can run regular Python code directly in the browser, included in py-script tag. This opens lots of new possibilities for serverless Python applications, using the same API and libraries you used to code on the server-side. Think about the browser as VM that runs your code.


UI for ML - Django, React or Streamlit?

Tue, 2022-04-26 11:29
UI is an important part for ML app to be successful. In this video I discuss multiple UI options I was looking into to build UI for our ML product. While deciding on which UI framework or library to use, you should point your attention to multiple things - such as ease of data transfer, UI flexibility, and ability to build user-friendly functionality.


Mindee docTR - Probably the Best Open-Source OCR

Mon, 2022-04-18 09:16
Do you want to build ML pipeline to automate data extraction from business documents (receipts, invoices, forms)? Then your first step should be to integrate OCR for text extraction. OCR extraction quality must be good, the whole pipeline will depend on initial text data extraction quality. If extracted data will be accurate, this means ML models will be able to run proper classification. I spent time researching available solutions for OCR and I think Mindee docTR currently is one of the best open-source OCR solutions available. Check the video, where I run and show multiple tests.


Document Information Extraction Demo on Hugging Face Spaces

Mon, 2022-04-11 14:41
This video shows how fine-tuned LayoutLMv2 document understanding and information extraction model runs on Hugging Face Spaces demo environment. I show how data extraction works for different receipts and why you should not rely on OCR which comes pre-configured together with LayoutLMv2 model.


Hugging Face LayoutLMv2 Model True Inference

Sun, 2022-03-27 14:33
I explain why OCR quality matters for Hugging Face LayoutLMv2 model performance, related to document data classification. If input from OCR is poor, ML classification inference results will be low quality too. This is why it is important to use high quality OCR system to extract text and coordinates from the document, before applying ML solution.


Get Receipt Data with Hugging Face ML Model

Sun, 2022-03-20 10:29
This tutorial is about how to use fine-tuned Hugging Face model to extract data from scanned receipt documents. We are executing inference action - passing receipt image, along with words and coordinates to the model. As a result, we get back predictions - class labels assigned to each input. This helps to classify document elements and extract correct data. I share a hint on how to match input words with classified labels. Input words and coordinates are expected to be retrieved from separate OCR.


Fine-Tuning with Hugging Face Trainer

Sun, 2022-03-13 16:37
In this tutorial, I explain how I was using Hugging Face Trainer with PyTorch to fine-tune LayoutLMv2 model for data extraction from the documents (based on CORD dataset with receipts). The advantage of Hugging Face Trainer - it simplifies model fine-tuning pipeline and you can easily upload the model to Hugging Face model hub.