They don't use dedup on blob's. It's different from linear storage.
do have any reference?
I explicitly talked about block level and your example only makes sense with unencrypted data, wich I wasnât talking about. After that itâs plain science. Explain, how - in encrypted data! - bit patterns are more likely to occour in a block than other bit patterns, and youâre on with deduping. If you canât, youâre not, because addressing is the copy will be at least as costly (in terms of space needed) as the real data. Itâs just maths, arguing wonât help you there.
I donât know if i have corretly understand, but
If Rclone come back on Amazon, we are not possible to re-use encrypted files ???
Thanks for reply,
No. This is just the meta-discussion how costly rclone users are for Amazon. It (the costlyness, not the discussion) may have impact on if Amazon reinstates rclone, but if rclone came back on this discussion wouldnât have any further impact on you using rclone with Amazon.
Simple math and using the YouTube example above.
You have 15 Ebytes of data, stored on 4 mbytes blocks. You are telling me that mathematically is impossible to have the exact same block repeated?
When you encrypt you just scramble your data using an algorithm. You donât invent new characters or symbols. And there is a finite number of possibilities.
Commercially? EMC datadomain
I dint said that it has no impact! I said I is negligible compared to the bandwidth and CPU for the pull and gets that a rclone mount uses
cost per TB? per year?
Itâs a enterprise Datacenter product. Itâs not a nas or a service, although they might lease the equipment.
The ones i know that exist here were bought and not leased
ok, so do the math. what does a TB cost you for your storage?
example: i bought it for X, it has Y TB. so?
I didnât talk about bandwith and CPU. So itâs a straw man.
I only pointed out, that de-duplication doesnât work with encrypted data. You seem no longer to contest that. Great.
Here we manage around 15PB of storage and use over 22 PB and there is a lot of room to spare. And yes the data is all encrypted
answer the questionâŠ
You buy X PB of data and you get X PB of data. If you buy this equipment you can maximize your use to POSSIBLY Y PB of data. Is that so difficult do understand?
Try dedup with encrypted data and zfs and then get back to me.
my point was that storage is not free (even for google) and you cannot get âalmost infiniteâ storage with dedup alone. therefore storage costs money (even for google). right?
are you using compression with your zfs too?
Itâs peanuts compared to the other costs
itâs more than the average âpoweruserâ pays.