Skip to navigation Skip to content
Simon Technology Blog
  • Architecture
  • Cloud
  • Database
  • Develop
  • Hardware
  • Industry
  • Language
  • Mobile
  • Opensource
  • OS
  • Web
Main Navigation

Category: Hadoop

Hadoop is a distributed system infrastructure developed by the Apache Foundation. Users can develop distributed programs without understanding the underlying details of distributed. Make full use of the power of clusters for high-speed computing and storage. Hadoop implements a distributed file system (Hadoop Distributed File System), referred to as HDFS. HDFS has the characteristics of high fault tolerance and is designed to be deployed on low-cost hardware; and it provides high throughput (high throughput) to access application data, suitable for those with large data sets (large data sets). set) application. HDFS relaxes the requirements of POSIX and can access data in the file system in the form of streaming access. The core design of the Hadoop framework is: HDFS and MapReduce. HDFS provides storage for massive amounts of data, while MapReduce provides calculations for massive amounts of data.

Hadoop – SQOOP cannot be imported into form

I am running a command on sqoop

sqoop import –connect jdbc:mysql://localhost/hadoopguide – table widgets My sqoop version: Sqoop 1.4.4.2.0.6.1-101
Hadoop – Hadoop 2.2.0.2.0.6.0-101

Tak

October 12, 2021By Simo Hadoop form, Hadoop, import, Sqoop, UnableLeave a Comment

Install Yarn live

[ Summary] I recently got an open source project on gayhub, so I am going to study the source code. Of course, the first step is to get the project up and running. Then I took a look at the technol

October 12, 2021By Simo Hadoop installation, live, YarnLeave a Comment

Hadoop – Apache ozie loading Sharelib failed

I got the following oozie.log:

org.apache.oozie.service.ServiceException: E0104: Could not fully initialize service [org.apache.oozie.service.ShareLibService], Not able to cache sharelib. An

October 12, 2021By Simo Hadoop apache, Failure, Hadoop, loading, Oozie, SharelibLeave a Comment

His longeting some HBase useful tools and features

1. HBase stores pictures, documents and other byte stream content

https://issues.apache.org/jira/browse/HBASE-11339

2. Speed ​​control of compact

https://issues.apache.org/jira/br

October 12, 2021By Simo Hadoop Features, HBase, Long-term, Some, tools, updated, usefulLeave a Comment

Hive – Hadoop merged file

I have run a map job with only 674 mappers, and hive has generated 674 .gz files, and I want to merge these files into 30-35 files. Pass Do not get the merged output, try the hive megre mapfilse at

October 12, 2021By Simo Hadoop file, Hadoop, hive, mergerLeave a Comment

Hadoop: Specify YARN Queu for DistCP

On our cluster, we set up a dynamic resource pool.

Set the rules so that the first yarn will look at the specified queue, and then Check the username, then check the main group…

But u

October 12, 2021By Simo Hadoop designated, DISTCP, Hadoop, queue, YarnLeave a Comment

Hadoop – How to Create a Spark DataFrame from Sequencefile

I am using spark 1.5. I want to create a data frame from a file in HDFS. The HDFS file contains json data with a large number of fields in a sequence input file format.

Is there a way to do t

October 12, 2021By Simo Hadoop create, DataFrame, Hadoop, How, Sequencefile, SPARKLeave a Comment

Hadoop cluster time synchronization

1. Cluster time synchronization
Find a machine, as a practical server, all machines will synchronize with the cluster time regularly, for example, synchronize the time every ten minutes.
1.1 Steps

October 12, 2021By Simo Hadoop cluster, Hadoop, Synchronization, timeLeave a Comment

YARN multi-tenant management

yarn multi-tenant configuration management (CapacityScheduler) hadoop version is 2.7

One: Before multi-tenant implementation, there is only one default queue

Second configuration file m

October 12, 2021By Simo Hadoop management, More, tenant, YarnLeave a Comment

Insert and query HBase speeds are slower

Surface problem: Inserting and querying HBase is slower

Check the HBase node status and find that it is running normally:

Check the status of accessing the HBase service and find that t

October 12, 2021By Simo Hadoop all slow, HBase, INSERT, query, speedLeave a Comment

Posts navigation

Page 1 … Page 3 Page 4 Page 5 … Page 10
Recent Posts
  • Sencha-Touch-2 – Sencha Touch 2, Nested XML Analysis NodeValue
  • Add a separation line and format XML content
  • Is there a norm of simplified XML subsets?
  • Look at it when you write React
  • ReactJS – Present React Redux React-Router App to add the server to the Firebase hosted by the Firebase
Categories
  • Android
  • Apache
  • Apache Kafka
  • Asp
  • Auto-Test
  • Automated Build
  • Aws
  • Bitcoin
  • Browser
  • C & C++
  • C#
  • Centos
  • Cgi
  • Character
  • Cloud Service
  • Cocos2dx
  • Cordova
  • CSS
  • Data Structure
  • Delphi
  • Design Pattern
  • Dojo
  • Dubbo
  • ELK
  • Flex
  • football
  • Game
  • Hadoop
  • Hibernate
  • HTML
  • Hybrid
  • Intel
  • IOS
  • Ipad
  • iPhone
  • Java
  • Javascript
  • Jetty
  • JQuery
  • Jsp
  • Linux
  • Load Balance
  • Lua
  • Macbook
  • Macos
  • Mathematics
  • Micro Services
  • Monitoring
  • Motherboard
  • Mysql
  • Network Hardware
  • Network Marketing
  • Nginx
  • NodeJs
  • Nosql
  • Oracle
  • Os Theory
  • Performance
  • PHP
  • Postgresql
  • Power Designer
  • React
  • Redis
  • Regexp
  • Rom
  • Rss
  • Ruby
  • Search Engines
  • Shell Script
  • Silicon Valley
  • Silverlight
  • Software Design
  • Spring
  • Sql
  • Sqlite
  • Sqlserver
  • Storage
  • Storm
  • Surface
  • SVN
  • Swift
  • System Architecture
  • Tablet
  • Uncategorized
  • Unix
  • Visual Basic
  • Visual Studio
  • Web Crawler
  • WebService
  • Windows
  • Wireless
  • XML
  • ZooKeeper
Archives
  • October 2021
  • September 2021
  • August 2021
  • May 2021
  • April 2021
  • September 2020
  • September 2019
  • August 2019
  • June 2019
  • May 2019
  • April 2019
  • March 2019
© Simon Technology Blog 2025 • ThemeCountry Powered by WordPress