Talk:Development ideas: Difference between revisions

From ReddNet
Jump to navigation Jump to search
No edit summary
Line 1: Line 1:
==Data Integrity==
==Data Integrity==


*The idea of 64k chunks per allocation for checksums is what's termed a hash list
*The idea of 64k chunks for checksums (subdividing an IBP allocation) is what's termed a hash list
**http://en.wikipedia.org/wiki/Hash_list
**http://en.wikipedia.org/wiki/Hash_list
*which can be extended to a multi-level hash tree
*which can be extended to a multi-level hash tree

Revision as of 10:54, 28 January 2008

Data Integrity

  • One could imagine a 3-level tree
    • a hash for each lowest-common-unit, say the commonly used 1K
    • a single top hash for each IBP allocation
    • a single top hash for the file
    • or more levels

If the IBP protocol were to be extended to support a single specific checksum method it would possibly be something like TTH. In this example, this would mean TTH usage becomes part of the IBP protocol specification and both the IBP client and IBP depot would be required to implement it. The next level tools and APIs like LoRS and libxio and lstcp would not be required to know anything about IBP internal checksums, but they could optionally use it.

  • If checksumming were optional, additional IBP commands to turn it on and off would be needed, and interoperability with non-checksumming depots becomes and issue.
  • Do the hashes go into the exnode, or are they only stored in the depot allocation tables?

- Dan 1/24/08