Table of Contents
Data De-duplication
Overview
CrashPlan's sophisticated data de-duplication is one of its most important features. Using advanced byte-pattern analysis, CrashPlan detects duplicate data and eliminates the need to back it up again. This means changed or copied files, documents and even embedded items in documents are backed up only once, reducing storage, network and performance costs. CrashPlan's data de-duplication operates on a per computer basis, which means that the files backed up from a single source computer are de-duplicated.
Data De-duplication Options
You can control the degree to which de-duplication is applied to your backup:
| Automatic | Full de-duplication is used when backing up over an Internet connection. Minimal is used when backing up directly to disk or over LAN. |
| Full | It is 100% effective, but is CPU-intensive. It is a little slower, but saves bandwidth and disk at destination. |
| Minimal | About 90% effective, it uses several methods to identify duplicate data. It is less CPU-intensive and when used to back up locally will speed up initial backup speed significantly, typically 400% on a single processor system. |
Benefits
- Backups consume less CPU, bandwidth and storage.
- Data de-duplication settings allow you to balance decreased CPU and speed vs saving bandwidth and disk at destinations.