PixCollect

PixCollect is a fast, multi-source image scraper designed to efficiently collect and save images for datasets and creative projects.

Features

Multi-threaded image scraping for fast and efficient downloads.
Supports sources sources like Google, and Pixabay.
Flexible configuration via appsettings.json file.
Saves images in various formats and integrates with your file system.

Installation

Clone the repository:

https://github.com/Isaac987/PixCollect.git

Navigate to the project directory:
```
cd PixCollect
```
Build the project:
```
dotnet build
```

Usage

Start a Scraping Session

Run a scraping session with a specific query and limit the number of images:

dotnet run scrape run <query> <limit>

query: The keyword or search term for the images.
limit: Maximum number of images to scrape.

Manage Image Sources

Enable or disable specific image sources for the session (currently supports: google, pixabay):

# Enable a source
dotnet run scrape enable-source <source>

# Disable a source
dotnet run scrape disable-source <source>

source: The name of the image source to enable or disable.

Configure Scrape Settings

View and modify default scrape settings:

# List current settings
dotnet run scrape list-settings

# Update a default setting
dotnet run scrape set-output-directory <directory-path>   # Change the output directory
dotnet run scrape set-format <image-format>              # Set the default image format
dotnet run scrape set-headless <true|false>              # Enable or disable headless mode

directory-path: Specifies the directory where scraped images will be saved. For example: /path/to/output..
image-format: The desired format for images (e.g., jpg, png). Must be a valid format.
true|false: Use true to enable headless mode or false to disable it.

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
.idea/.idea.PixCollect/.idea		.idea/.idea.PixCollect/.idea
Assets		Assets
CLI		CLI
Configuration		Configuration
Scraping		Scraping
Uploading		Uploading
.gitignore		.gitignore
PixCollect.csproj		PixCollect.csproj
PixCollect.sln		PixCollect.sln
Program.cs		Program.cs
README.md		README.md
appsettings.json		appsettings.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PixCollect

Features

Installation

Usage

Start a Scraping Session

Manage Image Sources

Configure Scrape Settings

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PixCollect

Features

Installation

Usage

Start a Scraping Session

Manage Image Sources

Configure Scrape Settings

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages