Skip to navigation Skip to content
Simon Technology Blog
  • Architecture
  • Cloud
  • Database
  • Develop
  • Hardware
  • Industry
  • Language
  • Mobile
  • Opensource
  • OS
  • Web
Main Navigation

Category: Hadoop

Hadoop is a distributed system infrastructure developed by the Apache Foundation. Users can develop distributed programs without understanding the underlying details of distributed. Make full use of the power of clusters for high-speed computing and storage. Hadoop implements a distributed file system (Hadoop Distributed File System), referred to as HDFS. HDFS has the characteristics of high fault tolerance and is designed to be deployed on low-cost hardware; and it provides high throughput (high throughput) to access application data, suitable for those with large data sets (large data sets). set) application. HDFS relaxes the requirements of POSIX and can access data in the file system in the form of streaming access. The core design of the Hadoop framework is: HDFS and MapReduce. HDFS provides storage for massive amounts of data, while MapReduce provides calculations for massive amounts of data.

Hive small knot

(1) Basic introduction to hive

Hive is a data warehouse tool based on Hadoop, which can map structured data files to a database table and provide SQL-like query functions

Other knowledg

October 12, 2021By Simo Hadoop hive, small knotLeave a Comment

Build a nuXT project (with Element UI, AXIOS)

Using Nuxt Nuxt.js document: https://zh.nuxtjs.org/guide/

npx create-nuxt-app

// or yarn create nuxt-app Run h3> npm run dev Routing Basic routing Nuxt.js is based on pages The directory

October 12, 2021By Simo Hadoop AXIOS, build, element, match, Nuxt, project, UILeave a Comment

Hive Basics 1

hadoop hive (emphasis) day-6

1) Hive metadata (similar to tables, column lengths, etc.) storage On mysql

1)create table table_name like old_table //only table

2)create table table

October 12, 2021By Simo Hadoop foundation, hive, KnowledgeLeave a Comment

How to use Spring data with couchbase without the _class property

Is there an easy way to use spring data couchbase with documents without the _class attribute?
In the sofa base, there is something like this in my sampling database:

{
“username”: “alice”,

October 12, 2021By Simo Hadoop Couchbase, data, how to use, springLeave a Comment

HBase (3) – Build full-distributed

HBase set up–Fully-distributed 1, instructions for building methods By default, HBase runs in standalone mode. Both standalone mode and pseudo-distributed mode are provided for the purposes of sm

October 12, 2021By Simo Hadoop built, distributed, Fully, HBase, ThirdLeave a Comment

(Heavy pound) fastest Hadoop fully distributed operation

1. Prepare the virtual machine Clone 3 linux virtual machines, only the machine with centos minimal mode installed

Network allocation table

Host name

IP address

hadoop1

<

August 22, 2021By Simo Hadoop complete, distributed, fastest, Hadoop, Heavy pound, runLeave a Comment

Beeline Link Hive error

Look at the problem: Beeline reports an error when connecting to hiveserver2. Connection string: hive –service beeline -u jdbc:hive2://s1:10000/hive

Error: Error: Could not open client trans

August 22, 2021By Simo Hadoop Beeline, error, hive, linkLeave a Comment

9, Hadoop-HDFS Overview

1. Background and definition of HDFS generation Background generation

As the amount of data becomes larger and larger, it is stored in a system If you don’t have all the data, you need to all

August 22, 2021By Simo Hadoop Hadoop, HDFS, overviewLeave a Comment

Arrays – Array intersects Hive

I have two string arrays in Hive

{‘value1′,’value2′,’value3’ }
{‘value1′,’value2’} I want to merge arrays without duplicates, the result:

{‘value1’, ‘value2′,’value3’} How can I do this

August 22, 2021By Simo Hadoop array, Arrays, hive, intersectionLeave a Comment

6-Hadoop operating mode (fully distributed) (on)

Note: In actual production and development, fully distributed is used

1) Prepare 3 clients (close firewall, static ip, host name)

2) Install JDK

3) Configure environment Variables

August 22, 2021By Simo Hadoop complete, distributed, Hadoop, mode, runLeave a Comment

Posts navigation

Page 1 … Page 8 Page 9 Page 10
Recent Posts
  • Sencha-Touch-2 – Sencha Touch 2, Nested XML Analysis NodeValue
  • Add a separation line and format XML content
  • Is there a norm of simplified XML subsets?
  • Look at it when you write React
  • ReactJS – Present React Redux React-Router App to add the server to the Firebase hosted by the Firebase
Categories
  • Android
  • Apache
  • Apache Kafka
  • Asp
  • Auto-Test
  • Automated Build
  • Aws
  • Bitcoin
  • Browser
  • C & C++
  • C#
  • Centos
  • Cgi
  • Character
  • Cloud Service
  • Cocos2dx
  • Cordova
  • CSS
  • Data Structure
  • Delphi
  • Design Pattern
  • Dojo
  • Dubbo
  • ELK
  • Flex
  • football
  • Game
  • Hadoop
  • Hibernate
  • HTML
  • Hybrid
  • Intel
  • IOS
  • Ipad
  • iPhone
  • Java
  • Javascript
  • Jetty
  • JQuery
  • Jsp
  • Linux
  • Load Balance
  • Lua
  • Macbook
  • Macos
  • Mathematics
  • Micro Services
  • Monitoring
  • Motherboard
  • Mysql
  • Network Hardware
  • Network Marketing
  • Nginx
  • NodeJs
  • Nosql
  • Oracle
  • Os Theory
  • Performance
  • PHP
  • Postgresql
  • Power Designer
  • React
  • Redis
  • Regexp
  • Rom
  • Rss
  • Ruby
  • Search Engines
  • Shell Script
  • Silicon Valley
  • Silverlight
  • Software Design
  • Spring
  • Sql
  • Sqlite
  • Sqlserver
  • Storage
  • Storm
  • Surface
  • SVN
  • Swift
  • System Architecture
  • Tablet
  • Uncategorized
  • Unix
  • Visual Basic
  • Visual Studio
  • Web Crawler
  • WebService
  • Windows
  • Wireless
  • XML
  • ZooKeeper
Archives
  • October 2021
  • September 2021
  • August 2021
  • May 2021
  • April 2021
  • September 2020
  • September 2019
  • August 2019
  • June 2019
  • May 2019
  • April 2019
  • March 2019
© Simon Technology Blog 2025 • ThemeCountry Powered by WordPress