Not Logged In

You could:

Log in
Register

research notes
  • Wikitips
  • Professional Alerts
  • Case Studies
  • How-to Notes
  • Community Questions
research meetings
  • Peer Incite Podcasts
  • Peer Incite Archive
Events
  • Enterprise Architect Summit 2008
    Oct 4-6, 2008
  • Peer Incite meeting - Topic: Best practice in tape backup and recovery
    Oct 7, 12:00-1:00 PM
  • Computerworld: Storage Networking World
    Oct 12-15, 2008
  • Usenix on the Road: Next Generation Storage Networking - 1/2 Day Lecture at the University of North Carolina
    Oct 16, 12:30-4:00 PM
  • Usenix on the Road: Next Generation Storage Networking - 1/2 Day Lecture at Virginia Tech
    Oct 21, 1:30-5:00 PM

Announcements
  • 10-07-08 Peer Incite: Best practice in tape backup and recovery
  • IBM's stealth XIV announcement
  • Welcome to Wikibon 2.0!
  • The IBM XIV Storage System Model A14
  • Storage Customers Seeing Green with Conserve IT
Home Profile Peers Wiki Groups Feedback


  • Article
  • Comments (0)
  • Page Protected
  • History
  • Vault
Tape deduplication standard is a win for users
  • Currently n/a/5 Stars.
  • 1
  • 2
  • 3
  • 4
  • 5
rate this
Last Update: Feb 16, 2008 | 08:27
Viewed 778 times | Community Rating: n/a
Originating Author: David Vellante



Originating Author: Dave Vellante

It would be nice to apply data deduplication at any application, but unfortunately it's not that simple. Performance considerations are important with this emerging technology. Writes are fairly clean, with any overhead of applying deduplication algorithms offset by the need to store less data, thereby reducing physical movement. But reads require more effort with the system reading the data (hopefully in cache), finding the hash, interpreting the hash and then reading the native data. As such, data deduplication should be primarily aimed at write-intensive applications.

In addition, data deduplication will perform best on serial data streams with no locking involved (e.g., database applications are unlikely to be good candidates). As well, users should target applications where lots of copies are being made over time and similar copies of data are being moved. Candidate applications include backup, archiving and even certain specific data-mining operations. For some larger applications certain files (e.g., history files) will be strong candidates.

Once these are determined, despite vendor implications to the contrary, users still should assume backing up at least some of the deduped data to tape. Today, the lack of deduplication technology in tape and the absence of standards means that data must be un-deduped to be backed up to tape. This adds additional overhead, complexity and elapsed time to backup and restore operations. A common deduplication standard across disk and tape would allow faster backups and allow deduped data to be restored to any disk that supports the standard, further simplifying operations and reducing reliance on proprietary vendor implementations.

Action Item: Users should push hard for both tape and disk vendors to develop data deduplication standards to facilitate simpler backup and restore operations. Organizations should be very careful about broadly committing to a single vendor solution without understanding the holistic implications on disk-tape-disk cloning, backup and restore processes.


To comment on this Professional Alert, please follow these simple instructions.

  • Login before editing or commenting on this page.
  • Comment on this article. You also can click the "+" tab above.
  • Please sign your comments by typing "~~~" at the end of your comment.

Community Comments

categories
Backup and restore, Storage disaster recovery, Storage professional alerts
Contributors

Dab4168

David Floyer

Bert Latamore

Comments (0)
Comments on 'Tape deduplication standard is a win for users'
There are currently no comments. Be the first!
Post A Comment

You must be logged in to post a comment, please Sign in

Revision ID Author Timestamp Comment
13959 Dab4168 08 Feb 16 20:27:34 Removed category Author dvellante
10173 Dvellante 07 Aug 28 15:46:08
8846 David Floyer 07 Jun 06 17:30:13 Refinement of analysis and advice
8840 Bert Latamore 07 Jun 06 13:22:20
8820 Dvellante 07 Jun 06 00:34:10
8819 Dvellante 07 Jun 06 00:32:34
8818 Dvellante 07 Jun 06 00:31:37
8817 Dvellante 07 Jun 06 00:23:39
8816 Dvellante 07 Jun 06 00:20:21
8815 Dvellante 07 Jun 06 00:01:39

Search:

news feed
  • Latest from Computerworld - Game economy grows with micropayments
  • eWeek - RSS Feeds - 5 Technology Businesses Poised to Boom in the Financial Crisis
  • InfoWorld RSS Feed - Microsoft lays out SQL Server roadmap
  • SearchStorage: News and trends in the storage industry - F5 Networks adds 10 GigE to ARX file virtualization product
  • Byte and Switch: - F5 Enhances File Virtualization Storage, Management
all »
blogs
  • Storagezilla - Sun batter NetApp in court
  • DrunkenData.com - Market Woes
  • StorageMojo - 3.5″ drives: the end is near
  • StorageRap - Mashup in blogland - will there be a future feeding franzy in 09?
  • Chuck's Blog - Virtual IT: A Frictionless World?
all »
companies
  • Compellent
  • Dell
  • Hitachi
  • EMC
  • EqualLogic
  • LeftHand Networks
all »
Want a Wikibon
Peer Incite
newsletter?

Email: Privacy by Safe Subscribe
Storage Spectrum
Order Storage Spectrum
By Fred Moore
US & Canada Only!
Browse best practices . publish tips . access project tools . collaborate with peers . get help on RFP's . use privacy settings to control who sees your info . join a group and share experiences with colleagues . review case studies . read professional alerts
  • Cloud Computing
    Clustered storage, Storage services, WEB2.0
  • Companies
    3PAR, Compellent, Dell, EMC, EqualLogic, HP, Hitachi, IBM, LSI, LeftHand Networks, NetApp, STEC inc, Sun, XIV
  • Data Protection
    Backup and restore, Business compliance, CDP, Data deduplication, Storage disaster recovery, Storage security
  • Energy Efficiency
    Data deduplication, Green storage, MAID, Thin provisioning, Tiered storage, VMware, Virtual tape
  • Planning Design Implementation Management
    Backup and restore, Business compliance, Data classification, Green storage, Managing storage, ROI, SRM, Storage Design, Storage asset management, Storage capacity management, Storage capacity planning, Storage implementation, Storage management, Storage operations, Storage planning, Storage vendor management, Tiered storage
  • Storage networks
    Clustered storage, ISCSI, NAS, SAN, SRM, Storage consolidation, Tiered storage, VMware
  • Virtualization
    Clustered storage, Green storage, Storage consolidation, Storage virtualization, Thin provisioning, VMware, Virtual tape
© Wikibon 2008 About Wikibon l Contacts l Terms of Service l Disclaimers l Privacy l Help