# SharpCompress Usage ## Async/Await Support (Beta) SharpCompress now provides full async/await support for all I/O operations. All `Read`, `Write`, and extraction operations have async equivalents ending in `Async` that accept an optional `CancellationToken`. This enables better performance and scalability for I/O-bound operations. **Key Async Methods:** - `reader.WriteEntryToAsync(stream, cancellationToken)` - Extract entry asynchronously - `reader.WriteAllToDirectoryAsync(path, cancellationToken)` - Extract all asynchronously - `writer.WriteAsync(filename, stream, modTime, cancellationToken)` - Write entry asynchronously - `writer.WriteAllAsync(directory, pattern, searchOption, cancellationToken)` - Write directory asynchronously - `entry.OpenEntryStreamAsync(cancellationToken)` - Open entry stream asynchronously See [Async Examples](#async-examples) section below for usage patterns. ## Stream Rules When dealing with Streams, the rule should be that you don't close a stream you didn't create. This, in effect, should mean you should always put a Stream in a using block to dispose it. However, the .NET Framework often has classes that will dispose streams by default to make things "easy" like the following: ```C# using (var reader = new StreamReader(File.Open("foo"))) { ... } ``` In this example, reader should get disposed. However, stream rules should say the the `FileStream` created by `File.Open` should remain open. However, the .NET Framework closes it for you by default unless you override the constructor. In general, you should be writing Stream code like this: ```C# using (var fileStream = File.Open("foo")) using (var reader = new StreamReader(fileStream)) { ... } ``` To deal with the "correct" rules as well as the expectations of users, I've decided to always close wrapped streams as of 0.21. To be explicit though, consider always using the overloads that use `ReaderOptions` or `WriterOptions` and explicitly set `LeaveStreamOpen` the way you want. Default behavior in factory APIs: - File path / `FileInfo` overloads set `LeaveStreamOpen = false`. - Caller-provided `Stream` overloads set `LeaveStreamOpen = true`. If using Compression Stream classes directly and you don't want the wrapped stream to be closed. Use the `NonDisposingStream` as a wrapper to prevent the stream being disposed. The change in 0.21 simplified a lot even though the usage is a bit more convoluted. ## Samples Also, look over the tests for more thorough [examples](https://github.com/adamhathcock/sharpcompress/tree/master/tests/SharpCompress.Test) ### Create Zip Archive from multiple files ```C# using(var archive = ZipArchive.CreateArchive()) { archive.AddEntry("file01.txt", "C:\\file01.txt"); archive.AddEntry("file02.txt", "C:\\file02.txt"); ... archive.SaveTo("C:\\temp.zip", CompressionType.Deflate); } ``` ### Create Zip Archive from all files in a directory to a file ```C# using (var archive = ZipArchive.CreateArchive()) { archive.AddAllFromDirectory("D:\\temp"); archive.SaveTo("C:\\temp.zip", CompressionType.Deflate); } ``` ### Create Zip Archive from all files in a directory and save in memory ```C# var memoryStream = new MemoryStream(); using (var archive = ZipArchive.CreateArchive()) { archive.AddAllFromDirectory("D:\\temp"); archive.SaveTo(memoryStream, new WriterOptions(CompressionType.Deflate) { LeaveStreamOpen = true }); } //reset memoryStream to be usable now memoryStream.Position = 0; ``` ### Extract all files from a rar file to a directory using RarArchive Note: Extracting a solid rar or 7z file needs to be done in sequential order to get acceptable decompression speed. `ExtractAllEntries` is primarily intended for solid archives (like solid Rar) or 7Zip archives, where sequential extraction provides the best performance. For general/simple extraction with any supported archive type, use `archive.WriteToDirectory()` instead. ```C# // Using fluent factory method for extraction options using (var archive = RarArchive.OpenArchive("Test.rar", ReaderOptions.ForOwnedFile() .WithExtractFullPath(true) .WithOverwrite(true))) { // Simple extraction with RarArchive; this WriteToDirectory pattern works for all archive types archive.WriteToDirectory(@"D:\temp"); } ``` ### Iterate over all files from a Rar file using RarArchive ```C# using (var archive = RarArchive.OpenArchive("Test.rar")) { foreach (var entry in archive.Entries.Where(entry => !entry.IsDirectory)) { Console.WriteLine($"{entry.Key}: {entry.Size} bytes"); } } ``` ### Extract solid Rar or 7Zip archives with progress reporting `ExtractAllEntries` only works for solid archives (Rar) or 7Zip archives. For optimal performance with these archive types, use this method: ```C# using SharpCompress.Common; using SharpCompress.Readers; var progress = new Progress(report => { Console.WriteLine($"Extracting {report.EntryPath}: {report.PercentComplete}%"); }); using (var archive = RarArchive.OpenArchive("archive.rar", ReaderOptions.ForOwnedFile() .WithProgress(progress) .WithExtractFullPath(true) .WithOverwrite(true))) // Must be solid Rar or 7Zip { archive.WriteToDirectory(@"D:\output"); } ``` ### Use ReaderFactory to autodetect archive type and Open the entry stream ```C# using (Stream stream = File.OpenRead("Tar.tar.bz2")) using (var reader = ReaderFactory.OpenReader(stream)) { while (reader.MoveToNextEntry()) { if (!reader.Entry.IsDirectory) { Console.WriteLine(reader.Entry.Key); reader.WriteEntryToDirectory(@"C:\temp"); } } } ``` ### Use ReaderFactory to autodetect archive type and Open the entry stream ```C# using (Stream stream = File.OpenRead("Tar.tar.bz2")) using (var reader = ReaderFactory.OpenReader(stream)) { while (reader.MoveToNextEntry()) { if (!reader.Entry.IsDirectory) { using (var entryStream = reader.OpenEntryStream()) { entryStream.CopyTo(...); } } } } ``` ### Use WriterFactory to write all files from a directory in a streaming manner. ```C# using (Stream stream = File.OpenWrite("C:\\temp.tgz")) using (var writer = WriterFactory.OpenWriter( stream, ArchiveType.Tar, WriterOptions.ForTar(CompressionType.GZip).WithLeaveStreamOpen(true))) { writer.WriteAll("D:\\temp", "*", SearchOption.AllDirectories); } ``` ### Extract zip which has non-utf8 encoded filename(cp932) ```C# var encoding = Encoding.GetEncoding(932); var opts = new ReaderOptions() .WithArchiveEncoding(new ArchiveEncoding { CustomDecoder = (data, x, y) => encoding.GetString(data) }); using var archive = ZipArchive.OpenArchive("test.zip", opts); foreach(var entry in archive.Entries) { Console.WriteLine($"{entry.Key}"); } ``` ## Custom Compression Providers By default `ReaderOptions` and `WriterOptions` already include `CompressionProviderRegistry.Default` via their `Providers` property, so you can read and write without touching the registry yet still get SharpCompress’s built-in implementations. The configured registry is used consistently across Reader APIs, Writer APIs, Archive APIs, and async entry-stream extraction, including compressed TAR wrappers and ZIP async decompression. To replace a specific algorithm (for example to use `System.IO.Compression` for GZip or Deflate), create a modified registry and pass it through the same options: ```C# var systemGZip = new SystemGZipCompressionProvider(); var customRegistry = CompressionProviderRegistry.Default.With(systemGZip); var readerOptions = ReaderOptions.ForOwnedFile() .WithProviders(customRegistry); using var reader = ReaderFactory.OpenReader(stream, readerOptions); var writerOptions = new WriterOptions(CompressionType.GZip) .WithProviders(customRegistry); using var writer = WriterFactory.OpenWriter(outputStream, ArchiveType.GZip, writerOptions); ``` The registry also exposes `GetCompressingProvider` (now returning `ICompressionProviderHooks`) when a compression format needs pre- or post-stream data (e.g., LZMA/PPMd). Implementations that need extra headers can supply those bytes through the `ICompressionProviderHooks` members while the rest of the API still works through the `Providers` property. ## Async Examples ### Async Reader Examples **Extract single entry asynchronously:** ```C# using (Stream stream = File.OpenRead("archive.zip")) using (var reader = ReaderFactory.OpenReader(stream)) { while (reader.MoveToNextEntry()) { if (!reader.Entry.IsDirectory) { using (var entryStream = reader.OpenEntryStream()) { using (var outputStream = File.Create("output.bin")) { await reader.WriteEntryToAsync(outputStream, cancellationToken); } } } } } ``` **Extract all entries asynchronously:** ```C# using (Stream stream = File.OpenRead("archive.tar.gz")) using (var reader = ReaderFactory.OpenReader(stream)) { await reader.WriteAllToDirectoryAsync( @"D:\temp", cancellationToken ); } ``` **Open and process entry stream asynchronously:** ```C# using (var archive = ZipArchive.OpenArchive("archive.zip")) { foreach (var entry in archive.Entries.Where(e => !e.IsDirectory)) { using (var entryStream = await entry.OpenEntryStreamAsync(cancellationToken)) { // Process the decompressed stream asynchronously await ProcessStreamAsync(entryStream, cancellationToken); } } } ``` ### Async Writer Examples **Write single file asynchronously:** ```C# using (Stream archiveStream = File.OpenWrite("output.zip")) using (var writer = WriterFactory.OpenWriter(archiveStream, ArchiveType.Zip, CompressionType.Deflate)) { using (Stream fileStream = File.OpenRead("input.txt")) { await writer.WriteAsync("entry.txt", fileStream, DateTime.Now, cancellationToken); } } ``` **Write entire directory asynchronously:** ```C# using (Stream stream = File.OpenWrite("backup.tar.gz")) using (var writer = WriterFactory.OpenWriter(stream, ArchiveType.Tar, new WriterOptions(CompressionType.GZip))) { await writer.WriteAllAsync( @"D:\files", "*", SearchOption.AllDirectories, cancellationToken ); } ``` **Write with progress tracking and cancellation:** ```C# var cts = new CancellationTokenSource(); // Set timeout or cancel from UI cts.CancelAfter(TimeSpan.FromMinutes(5)); using (Stream stream = File.OpenWrite("archive.zip")) using (var writer = WriterFactory.OpenWriter(stream, ArchiveType.Zip, CompressionType.Deflate)) { try { await writer.WriteAllAsync(@"D:\data", "*", SearchOption.AllDirectories, cts.Token); } catch (OperationCanceledException) { Console.WriteLine("Operation was cancelled"); } } ``` ### Archive Async Examples **Extract from archive asynchronously:** ```C# using (var archive = ZipArchive.OpenArchive("archive.zip")) { // Simple async extraction - works for all archive types await archive.WriteToDirectoryAsync( @"C:\output", cancellationToken ); } ``` **Benefits of Async Operations:** - Non-blocking I/O for better application responsiveness - Improved scalability for server applications - Support for cancellation via CancellationToken - Better resource utilization in async/await contexts - Compatible with modern .NET async patterns