Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Is there an advantage to parsing backup formats to deduplicate?

April 26, 2017advantage backup deduplicate formats parsing

0

Posted

Is there an advantage to parsing backup formats to deduplicate?

1 Answer

0

Posted

To be application independent and support the broad variety of Nearline applications, it is much more straightforward to work independently of application specific formats. Some vendors go against this trend and are content-dependent. This means they are locked into support of particular backup products and revisions; they parse those formats and create an internal file system, so that when a new file version comes in, they can compare it to its prior entry in its directory and store only the changes, not unlike a version control system for software development. This approach sounds promising – it could optimize compression tactics for particular data types, for example – but in practice it has more weaknesses than strengths. First, it is very capital intensive to develop. Second, it always involves some amount of reverse engineering, and sometimes the format originators are not supportive, so it will never be universal. Third, it makes it hard to find redundancy in other parts of the