PageRenderTime 47ms CodeModel.GetById 18ms RepoModel.GetById 1ms app.codeStats 0ms

/tags/release-0.0.0-rc0/README.txt

#
Plain Text | 38 lines | 28 code | 10 blank | 0 comment | 0 complexity | ead539d50898a3344fdf418a14518499 MD5 | raw file
Possible License(s): Apache-2.0, BSD-3-Clause, JSON, CPL-1.0
  1. Apache HCatalog
  2. ===============
  3. HCatalog is a table and storage management service for data created using Apache
  4. Hadoop.
  5. The vision of HCatalog is to provide table management and storage management layers
  6. for Apache Hadoop. This includes:
  7. * Providing a shared schema and data type mechanism.
  8. * Providing a table abstraction so that users need not be concerned with where
  9. or how their data is stored.
  10. * Providing interoperability across data processing tools such as Pig, Map
  11. Reduce, Streaming, and Hive.
  12. Data processors using Apache Hadoop have a common need for table management
  13. services. The goal of this table management service is to track data that exists in
  14. a Hadoop grid and present that data to users in a tabular format. HCatalog
  15. provides a single input and output format to users so that individual users need
  16. not be concerned with the storage formats that are chosen for particular data
  17. sets. Data is described by a schema and shares a datatype system.
  18. Users are free to choose the best tools for their use cases. The Hadoop project
  19. includes Map Reduce, Streaming, Pig, and Hive, and additional tools exist such
  20. as Cascading. Each of these tools has users who prefer it, and there are use
  21. cases best addressed by each of these tools. Two users on the same grid who
  22. share data are not constrained to use the same tool but with HCatalog are free
  23. to choose the best tool for their use case. HCatalog presents data in the same
  24. way to all of the tools, providing interfaces to each of them.
  25. For the latest information about HCatalog, please visit our website at:
  26. http://incubator.apache.org/hcatalog
  27. and our wiki, at:
  28. https://cwiki.apache.org/confluence/display/HCATALOG