Hash scraper technology blog

Understanding Advanced XPath Content: Wildcards, Nodes, Functions

Understanding Advanced XPath Content: Wildcards, Nodes, Functions

0. Overview In XPath Part 1, we covered the essential concepts and basic syntax of XPath necessary for web crawling. In Part 2, we will delve into advanced topics of XPath. 1. Understanding Wildcar...

Read more →
Exploring XPath: Basics of Web Crawling

Exploring XPath: Basics of Web Crawling

0. What is XPath? XPath is an abbreviation for 'XML Path Language', a language for specifying paths to access specific elements or attributes in an XML document. XPath is commonly used in web crawl...

Read more →
Automating web browsers using Microsoft Playwright

Automating web browsers using Microsoft Playwright

0. What is Playwright? Playwright is an open-source library developed by Microsoft for automating web browsers. Although it was not originally designed for web scraping, it demonstrates excellent p...

Read more →
Automating web crawling using Python: schedule, Task Scheduler, crontab

Automating web crawling using Python: schedule, Task Scheduler, crontab

0. Web Crawling, Manual Execution Too Troublesome? Have you found it cumbersome to manually execute web crawling code? We introduce a method for Python code to run automatically at desired times an...

Read more →
Indexing techniques and best practices related to SEO optimization.

Indexing techniques and best practices related to SEO optimization.

1. Check for Issues with Google Search Console After verifying the domain, once the crawling bot starts crawling, data will be visible. Please check back in a few days to see the data. Common error...

Read more →
A case study of a customer who dramatically improved their defect rate (feat. SHAP library)

A case study of a customer who dramatically improved their defect rate (feat. SHAP library)

0. Overview One of the Hashscraper customer cases used an AI model to reduce the defect rate in the process. I wrote this article to explain the predictions of the machine learning model in a more ...

Read more →
There is a company that dramatically reduced the defect rate in the process using an AI model? (feat. SHAP library)

There is a company that dramatically reduced the defect rate in the process using an AI model? (feat. SHAP library)

0. Overview One of the Hashscraper customer cases used an AI model to reduce the defect rate in the process. I have written this article to explain the predictions of the machine learning model mor...

Read more →
Easy Handling of AWS: Tips for Beginner Developers

Easy Handling of AWS: Tips for Beginner Developers

0. Is AWS (Amazon Web Services) just too complicated? As much as it is full of useful features, Amazon Web Services (AWS) is notorious for its high level of difficulty for beginner developers. Do y...

Read more →
Creating a Coupang crawling bot with ChatGPT: Extracting product information from search results

Creating a Coupang crawling bot with ChatGPT: Extracting product information from search results

0. Overview These days, thanks to ChatGPT, the world has become much easier to develop. Can we easily create a web scraping bot with ChatGPT as well? Let's start developing a Coupang web scraping b...

Read more →

Get notified of new posts

We'll email you when 해시스크래퍼 기술 블로그 publishes new content.

Your email will only be used for new post notifications.