apache ozone
这个想法自2014年下半年在HDFS社区开始提出,2015年正式启动。因为Ozone的发展节奏太过迅速,远超HDFS的版本迭代节奏,2020年Ozone正式从Hadoop的社区剥离,成为单独的项目,也就是现在的Apache Ozone。这个项目的核心目标是解决HDFS的NameNode扩展问题,解决其小文件导致的存储瓶颈问题,并希望能支持100+亿的文件处理。
- 腾讯赵明超:Apache Ozone下一代大数据存储解密
- 取代 HDFS?Ozone 在腾讯的最新研究进展
- https://arch-long.cn/articles/hadoop/Ozone.html
install
# wget https://archive.apache.org/dist/ozone/2.0.0/ozone-2.0.0.tar.gz
wget https://mirrors.ustc.edu.cn/apache/ozone/2.0.0/ozone-2.0.0.tar.gz
mkdir -p ozone
tar -xvzf ozone-2.0.0.tar.gz --strip-components 1 -C ./ozone/
./bin/ozone --help
Usage: ozone [OPTIONS] SUBCOMMAND [SUBCOMMAND OPTIONS]
OPTIONS is none or any of:
--buildpaths attempt to add class files from build tree
--config dir Ozone config directory
--daemon (start|status|stop) operate on a daemon
--debug turn on shell script debug mode
--help usage information
--hostnames list[,of,host,names] hosts to use in worker mode
--hosts filename list of hosts to use in worker mode
--jvmargs arguments append JVM options to any existing options defined in the OZONE_OPTS environment variable. Any defined in OZONE_CLIENT_OPTS will be append after these jvmargs
--loglevel level set the log4j level for this command
--validate (continue) validates if all jars as indicated in the corresponding OZONE_RUN_ARTIFACT_NAME classpath file are present, command execution shall continue post validation failure if 'continue' is passed
--workers turn on worker mode
SUBCOMMAND is one of:
Admin Commands:
daemonlog get/set the log level for each daemon
Client Commands:
admin Ozone admin tool
checknative checks if native libraries are loaded
classpath prints the class path needed for running ozone commands
debug Ozone debug tool
dtutil operations related to delegation tokens
envvars display computed Hadoop environment variables
freon runs an ozone data generator
fs run a filesystem command on Ozone file system. Equivalent to 'hadoop fs'
genconf generate minimally required ozone configs and output to ozone-site.xml in specified path
getconf get ozone config values from configuration
insight tool to get runtime operation information
ratis Ozone ratis tool
repair Ozone repair tool
s3 command line interface for s3 related operations
sh command line interface for object store operations
tenant command line interface for multi-tenant related operations
version print the version
Daemon Commands:
csi run the standalone CSI daemon
datanode run a HDDS datanode
httpfs run the HTTPFS compatible REST gateway
om Ozone Manager
recon run the Recon service
s3g run the S3 compatible REST gateway
scm run the Storage Container Manager service
SUBCOMMAND may print help when invoked w/o parameters or with -h.