• Skip to secondary menu
  • Skip to main content
  • Skip to primary sidebar

Python and R Tips

Learn Data Science with Python and R

  • Home
  • Python
  • Pandas
    • Pandas 101
  • tidyverse
    • tidyverse 101
  • R
  • Linux
  • Conferences
  • Python Books
  • About
    • Privacy Policy
You are here: Home / Python / gzip library in Python / How to Read a gzip File in Python?

How to Read a gzip File in Python?

February 8, 2018 by cmdlinetips

read and write gzip file in Python 3
uncompress gzip file in Python

gzip file format is one of the most common formats for compressing/decompressing files. gzip compression on text files greatly reduce the space used to store the text file. If you are working with a big data file, often the big text files is compressed with gzip or “gzipped” to save space. A naive way to work with compressed gzip file is to uncompress it and work with much bigger unzipped file line by line. Clearly, that is not the best solution.

In Python, you can directly work with gzip file. All you need is the Python library gzip.

import gzip

How to read a gzip file line by line in Python?

with gzip.open('big_file.txt.gz', 'rb') as f:
     for line in f:        
         print(line)  

How to Create a gzip File in Python

We can also use gzip library to create gzip (compressed) file by dumping the whole text content you have

all_of_of_your_content = "all the content of a big text file"
with gzip.open('file.txt.gz', 'wb') as f:
    f.write(all_of_your_content)

How to create gzip (compressed file) from an existing file?

We can create gzip file from plain txt file (unzipped) without reading line by line using shutil library. The shutil module offers high-level operations on files copying and deletion. We will first open the unzipped file, then open the zipped file and use shutil to copy the unzipped file object to zipped file object.

import shutil
# open the unzipped file with flie handler inp_f
with open("test_file.txt","rb") as inp_f:
     # open the output zipped file with file handler out_f
     with gzip.open("test_file.txt.gz","wb") as out_f:
         # use shutil to copy the file objec
         shutil.copyfileobj(inp_f,out_f)

Share this:

  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on X (Opens in new window) X

Related posts:

Default ThumbnailTen Linux Commands on “tar” and “gzip” You Need to Know Default ThumbnailHow to Save Pandas Dataframe as gzip/zip File? Default ThumbnailHow to read entire text file in Python? How to read a numerical data/file in Python with numpy?

Filed Under: gzip library in Python, read a gzip file in Python Tagged With: create gzip file in Python, gzip module in Python, read a gzip file in Python, shutil module

Primary Sidebar

Subscribe to Python and R Tips and Learn Data Science

Learn Pandas in Python and Tidyverse in R

Tags

Altair Basic NumPy Book Review Data Science Data Science Books Data Science Resources Data Science Roundup Data Visualization Dimensionality Reduction Dropbox Dropbox Free Space Dropbox Tips Emacs Emacs Tips ggplot2 Linux Commands Linux Tips Mac Os X Tips Maximum Likelihood Estimation in R MLE in R NumPy Pandas Pandas 101 Pandas Dataframe Pandas Data Frame pandas groupby() Pandas select columns Pandas select_dtypes Python Python 3 Python Boxplot Python Tips R rstats R Tips Seaborn Seaborn Boxplot Seaborn Catplot Shell Scripting Sparse Matrix in Python tidy evaluation tidyverse tidyverse 101 Vim Vim Tips

RSS RSS

  • How to convert row names to a column in Pandas
  • How to resize an image with PyTorch
  • Fashion-MNIST data from PyTorch
  • Pandas case_when() with multiple examples
  • An Introduction to Statistical Learning: with Applications in Python Is Here
  • 10 Tips to customize ggplot2 title text
  • 8 Plot types with Matplotlib in Python
  • PCA on S&P 500 Stock Return Data
  • Linear Regression with Matrix Decomposition Methods
  • Numpy’s random choice() function

Copyright © 2025 · Lifestyle Pro on Genesis Framework · WordPress · Log in

Go to mobile version