Apache TezYou might have about Tez a lot if you are using HDP distribution of Hive but if you are new to the HDP/CDP or have used Hive on MR only…Oct 31, 2020Oct 31, 2020
Hive Datanucleus ConnectionPoolConnectionPool: A “connection pool” is a cache of database connection objects. Connection pools promotes the reuse of connection objects…Oct 29, 2020Oct 29, 2020
Locks in HiveDatabase has locks and Hive is no different. Locks in database can be either Read Lock or Write Lock. Locks are used when concurrent…Oct 26, 2020Oct 26, 2020
Hive LLAP CachingCache is used mostly for BI queries as compared to ETL queries. hive.llap.io.threadpool.size is at the node level and it defines the…Oct 26, 2020Oct 26, 2020
Hive 3 (Architecture)Hive 3 has seen lot of changes in terms of Architecture like default Table type as ACID, deprecating hive cli (thick Jdbc client) and only…Oct 26, 2020Oct 26, 2020
Hadoop Yarn ATS Rest APIsFor Yarn application, to fetch application’s data we can use Rest APIs on ATS below are some reference links:Oct 25, 2020Oct 25, 2020
Orc Split Strategies (Hive)What strategy ORC should use to create splits for execution. The available options are “BI”, “ETL” and “HYBRID”.Oct 25, 2020Oct 25, 2020