Category: Hadoop

Hadoop is a distributed system infrastructure developed by the Apache Foundation. Users can develop distributed programs without understanding the underlying details of distributed. Make full use of the power of clusters for high-speed computing and storage. Hadoop implements a distributed file system (Hadoop Distributed File System), referred to as HDFS. HDFS has the characteristics of high fault tolerance and is designed to be deployed on low-cost hardware; and it provides high throughput (high throughput) to access application data, suitable for those with large data sets (large data sets). set) application. HDFS relaxes the requirements of POSIX and can access data in the file system in the form of streaming access. The core design of the Hadoop framework is: HDFS and MapReduce. HDFS provides storage for massive amounts of data, while MapReduce provides calculations for massive amounts of data.

What is the best Python implementation in MapReduce mode?

What is the best Python implementation of MapReduce, a framework or library, may be as good as Apache hadoop, but if it is only in Python, and the best in terms of good documentation and ease of un

October 12, 2021By Simo Hadoop best, MapReduce, mode, python, realization, whatLeave a Comment

Hadoop gen1 vs Hadoop gen2

I am a bit confused about the location of tasktracker in Hadoop-2.x.

The daemons in Hadoop-1.x are namenode, datanode, jobtracker, taskracker and secondarynamenode

The daemons in Hadoop

October 12, 2021By Simo Hadoop Gen, gen1, gen2, Hadoop, VSLeave a Comment

HBASE-JAVA-API operation table

package com.itheima;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.hbase.*;
import org.apache.hadoop.hbase.client.*;
import org.apache.hadoop.hbase.filter.*;
import o

October 12, 2021By Simo Hadoop API, HBase, Java, operationLeave a Comment

Hadoop – How do I check if HDFS is running?

I want to see if Hadoop’s hdfs file system is working properly. I know that jps lists the running daemons, but I don’t actually know which daemons to look for. < p>

I ran the following comman

October 12, 2021By Simo Hadoop check, Hadoop, HDFS, How, is it, runLeave a Comment

The second bomb of those pits of Hadoop CDH

Fuck. . . . Can’t connect to the datanode. I don’t know why the datanode can’t be connected. .

2019-07-19 16:10:00,156 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: hadoop102

October 12, 2021By Simo Hadoop bombs, CDH, Hadoop, pits, Second, thoseLeave a Comment

OCP-052 Exam Question Board (21) -Cuug Interior Solution Edition

Which two are true about the Archive (ARCn) processes?

A)They archive redo directly from the redo log buffer.

B)They are used during instance recovery.

C)They automatically delete

October 12, 2021By Simo Hadoop Cuug, Edition, Exam, interior, OCP, Question, solution, SummaryLeave a Comment

hadoop – Apache Hive regexp_extract UDF

I encountered a piece of code in Apache Hive, such as regexp_extract(input,'[0-9] *’,0), can someone explain to me what this code does? Thank you Starting from the Hive manual DDL, it returns the

October 12, 2021By Simo Hadoop apache, extract, Hadoop, hive, regexp, UDFLeave a Comment

MapReduce on Hadoop said “output file already exists”

I ran a wordcount example using Mapreduce for the first time, and it worked. Then, I stopped the cluster, started it temporarily, and followed the same steps. This error is displayed:

10P:/$

October 12, 2021By Simo Hadoop already existed, file, Hadoop, MapReduce, Output, saidLeave a Comment

Iframe calling the Easyui box in the parent page

Reposted from https://www.cnblogs.com/puke/archive/2012/09/13/2683067.html

Have tried this method

This way you can get the elements of the parent page , But when calling the method of E

October 12, 2021By Simo Hadoop bomb, box, call, easyUi, IFRAM, page, parentLeave a Comment

Ten, HDFS NameNode work mechanism

[TOC]

txid:< br>namenode gives a unique id for each operation event (addition, deletion, modification operation), called txid, which is generally incremented from 0. For each additional opera

October 12, 2021By Simo Hadoop HDFS, Mechanism, NameNode, Ten, WorkLeave a Comment