Skip to content

CFOCoder

  • Current Page: Home
  • About Me
  • LinkedIn
  • Resume
  • Contact
  • Power BI Dashboard Samples
  • Current Page: Home
  • About Me
  • LinkedIn
  • Resume
  • Contact
  • Power BI Dashboard Samples
CFOCoder

Querying Apache Hive from DBeaver: Starting HiveServer2 and Connecting a Desktop SQL Client

Part 6 in the Hadoop and Hive Tutorial Series In the previous posts of this series, I installed Hadoop 3.3.6 natively on Ubuntu, configured YARN, ran MapReduce jobs, installed Apache Hive 3.1.3 on top of Hadoop, and finally loaded CSV files into Hive external tables so they could be queried...

Continue reading...
March 24, 2026 by Hector Sanchez Hadoop

From HDFS to SQL Queries: Loading CSV Files into Hive External Tables and Querying with SQL

Part 5 in the Hadoop and Hive Tutorial Series Introduction When I completed the installation of Hadoop 3.3.6 and Apache Hive 3.1.3 on my Ubuntu machine, I had everything running smoothly. But then came a practical question that every data engineer faces: How do I actually get data into this system...

Continue reading...
March 18, 2026 by Hector Sanchez Hadoop

Restic + MinIO for OpenClaw: What It Is, What It Solves, and the Quick Reference I Wanted Yesterday

A bit of personal context Yesterday I spent part of the day optimizing my OpenClaw setup and cleaning up the way I protect its operational state. At one point, I realized something important: the local workspace was no longer just “scratch space.” It already contained memory, credentials, agent configuration, scripts,...

Continue reading...
March 16, 2026 by Hector Sanchez Linux

Building a Modern Frontier Data Stack: Hadoop 3.4.3, Hive 4.2.0, and MinIO S3 Integration in 2026

A bit of personal context A few days ago, I published posts about how to install Hadoop 3.3.6 natively on Ubuntu. At that time, I thought it was the state of the art. But things in the Big Data world move fast. Fast foward a few days, and when I...

Continue reading...
March 12, 2026 by Hector Sanchez Hadoop

Apache Hive 3.1.3 on Ubuntu: Native Installation on Top of Hadoop 3.3.6

In Part 1 of this series, I installed Hadoop 3.3.6 natively on Ubuntu 24.04 and configured HDFS in pseudo-distributed mode. In Part 2, I configured YARN and ran the canonical WordCount job on War and Peace. In Part 3, I improved the text processing pipeline by normalizing words before counting them. The natural next...

Continue reading...
March 10, 2026 by Hector Sanchez Hadoop
Page 1 of 1712345...10...»Last »
Hector Sanchez
Public Accountant, Financial Analyst and Analytics Engineer

Categories

  • AI (13)
  • Blog (1)
  • Cloud (15)
  • Data Science (15)
  • Hadoop (7)
  • Linux (30)
  • Optimization (1)
  • Python (2)
  • SQL (5)

Recent Posts

  • Querying Apache Hive from DBeaver: Starting HiveServer2 and Connecting a Desktop SQL Client
  • From HDFS to SQL Queries: Loading CSV Files into Hive External Tables and Querying with SQL
  • Restic + MinIO for OpenClaw: What It Is, What It Solves, and the Quick Reference I Wanted Yesterday
  • Building a Modern Frontier Data Stack: Hadoop 3.4.3, Hive 4.2.0, and MinIO S3 Integration in 2026
  • Apache Hive 3.1.3 on Ubuntu: Native Installation on Top of Hadoop 3.3.6
  • Terms & conditions
  • Cookie Policy
CFOCoder

CFOCoder © 2026. All Rights Reserved.

Manage Cookie Consent
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
Functional Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
  • Manage options
  • Manage services
  • Manage {vendor_count} vendors
  • Read more about these purposes
View preferences
  • {title}
  • {title}
  • {title}