Locations - Spacedrive

Locations are directories that Spacedrive tracks and monitors. When you add a location, Spacedrive indexes its contents and watches for changes in real-time.

What is a Location?

A location is any folder on your device that you want Spacedrive to index. Once added:

Files are indexed immediately
Changes are detected in real-time
Metadata syncs across your devices (if enabled)
Content gets unique identifiers for deduplication

Location Architecture

Locations have a sophisticated relationship with the file index:

Root Entry

When you add a location, Spacedrive creates a root entry in the entries table for the location directory itself. This root entry:

Has no parent (parent_id = NULL)
Gets a self-reference in the entry_closure table
Serves as the ancestor for all files/folders within the location
Is stored in the directory_paths table with its full filesystem path

// Location points to root entry
Location {
    entry_id: Some(123),  // References entries table
    // ... other fields
}

// Root entry for "/Users/alice/Documents"
Entry {
    id: 123,
    name: "Documents",
    kind: Directory,
    parent_id: None,  // Root has no parent
    // ... other fields
}

Why Root Entries?

This design enables nested locations without duplicating the file tree. If you have:

Location A: /Users/alice/Documents (entry_id: 123)
Location B: /Users/alice/Documents/Work (entry_id: 124)

The entries table contains a single unified tree:

Entry(123): "Documents"           parent_id: NULL
  ├─ Entry(124): "Work"            parent_id: 123  ← Location B points here
  │   └─ Entry(125): "report.pdf"  parent_id: 124
  └─ Entry(126): "Photos"          parent_id: 123
      └─ Entry(127): "sunset.jpg"  parent_id: 126

Benefits:

No duplication - Work/report.pdf exists once in the database, accessible from both locations
Consistent hierarchy - Parent-child relationships are preserved regardless of location boundaries
Efficient storage - Adding nested locations doesn’t create redundant entry records
Flexible organization - Users can create fine-grained locations within broader ones
Different index modes - Apply Deep indexing to /Photos but Shallow to /Photos/RAW without duplicating entries

Without this design, overlapping locations would require duplicating all entries, leading to sync conflicts and wasted storage.

Entry Hierarchy

All files and folders within a location form a tree structure:

Location (entry_id: 123)
  └─ Entry(123): "Documents"           parent_id: NULL
      ├─ Entry(124): "Work"            parent_id: 123
      │   └─ Entry(125): "report.pdf"  parent_id: 124
      └─ Entry(126): "Photos"          parent_id: 123
          └─ Entry(127): "sunset.jpg"  parent_id: 126

The entry_closure table enables efficient queries:

“Find all files in this location” → Query where ancestor_id = 123
“Get file path” → Walk up closure table following parent relationships
“Calculate total size” → Aggregate from all descendants

Directory Paths Table

Directory paths are stored separately for performance:

-- directory_paths table
entry_id | path
---------|---------------------------
123      | /Users/alice/Documents
124      | /Users/alice/Documents/Work
126      | /Users/alice/Documents/Photos

This avoids storing redundant path information on every file entry and makes path-based lookups significantly faster. Only directories are stored in this table, not individual files.

Device Ownership & Sync

Entries don’t have a device_id field. Instead, ownership is inherited from the location:

// Location owns all its entries
Location {
    device_id: 42,    // Device that owns this location
    entry_id: 123,    // Root of the entry tree
}

// Entries don't store device_id
Entry {
    id: 124,
    parent_id: 123,   // Part of location's tree
    // NO device_id field
}

Why this matters for sync:

Efficient ownership queries - Find all entries owned by a device via entry_closure:

SELECT e.* FROM entries e
INNER JOIN entry_closure ec ON e.id = ec.descendant_id
WHERE ec.ancestor_id IN (
    SELECT entry_id FROM locations WHERE device_id = ?
);

No redundant storage - Avoids storing device_id on millions of entry records
Instant ownership transfer - When you move an external drive between devices, just update the location’s device_id. All millions of files instantly change ownership without touching the entries table. See Library Sync - Portable Volumes for details.

Sync implications:

Only the owning device can modify entries in a location
Other devices have read-only views of remote locations
Location ownership change is a single-row update (not a bulk migration)

Core Properties

pub struct Location {
    pub id: i32,                     // Database primary key
    pub uuid: Uuid,                  // Unique identifier
    pub device_id: i32,              // Device that owns this location
    pub entry_id: Option<i32>,       // Root entry for this location's tree
    pub name: Option<String>,        // Display name
    pub index_mode: String,          // "shallow" | "content" | "deep"
    pub scan_state: String,          // "pending" | "scanning" | "completed" | "error"
    pub last_scan_at: Option<DateTime<Utc>>,
    pub error_message: Option<String>,
    pub total_file_count: i64,
    pub total_byte_size: i64,
    pub job_policies: Option<String>, // JSON-serialized policies
    pub created_at: DateTime<Utc>,
    pub updated_at: DateTime<Utc>,
}

See Data Model - Location for the complete database schema.

Index Modes

Choose how deeply Spacedrive analyzes your files:

Shallow: Basic metadata only (name, size, dates)
Content: Includes content hashing for deduplication
Deep: Full media processing with thumbnails and metadata

Scan States

Locations track their scanning status:

pub enum ScanState {
    Idle,                    // Not scanning
    Scanning { progress: u8 }, // Currently scanning (0-100%)
    Completed,               // Scan finished successfully
    Failed,                  // Scan encountered errors
    Paused,                  // Scan was paused
}

Adding Locations

Using the CLI

# Add a location
spacedrive location add ~/Documents --name "Documents"

# Add with deep indexing
spacedrive location add ~/Photos --name "Photos" --mode deep

# List all locations
spacedrive location list

Using the API

let (location_id, job_id) = location_manager
    .add_location(
        library,
        PathBuf::from("/Users/alice/Desktop"),
        Some("Desktop".to_string()),
        device_id,
        IndexMode::Content,
    )
    .await?;

What Happens When You Add a Location

The location creation process is atomic (uses database transactions):

Path validation - Ensures the directory exists and is accessible
Create root entry - Creates an entry for the location directory
- Assigns a UUID for sync compatibility
- Sets parent_id = NULL (this is a root)
- Sets indexed_at to enable sync emission
Create closure record - Adds self-reference to entry_closure table
Create directory path - Adds full path to directory_paths table
Duplicate check - Verifies no location already exists for this entry
Create location record - Stores location metadata
- Links to root entry via entry_id
- Sets initial scan_state = "pending"
Commit transaction - All or nothing (prevents partial state)
Emit sync event - Broadcasts root entry StateChange for sync
Start indexing job - Begins scanning child files/folders
Watcher setup - Begins monitoring changes
Event broadcast - Notifies other services

If any step fails, the entire transaction rolls back.

File System Watching

The watcher service provides real-time monitoring across all platforms.

Platform Support

macOS: Uses FSEvents for efficient volume-level monitoring Linux: Uses inotify for precise file-level events Windows: Uses ReadDirectoryChangesW for real-time updates

Event Types

The watcher detects these file system changes:

pub enum WatcherEvent {
    Create,     // New file or directory
    Modify,     // Content or metadata changed
    Remove,     // File or directory deleted
    Rename {    // Move or rename operation
        from: PathBuf,
        to: PathBuf,
    },
}

Automatic Filtering

The watcher respects the indexer rules configured for each location. These rules determine which files and directories are tracked: Available Indexer Rules:

no_system_files - Ignores .DS_Store, Thumbs.db, and other OS-generated files
no_hidden - Ignores hidden files (dotfiles) except important ones like .gitignore
no_git - Ignores .git directories
gitignore - Respects .gitignore patterns in the directory tree
only_images - Only processes image files
no_dev_dirs - Ignores node_modules/, target/, .cache/, and other build directories

Default Configuration:

RuleToggles {
    no_system_files: true,   // Skip OS junk
    no_hidden: false,        // Include dotfiles
    no_git: true,            // Skip .git
    gitignore: true,         // Respect .gitignore
    only_images: false,      // Index all file types
    no_dev_dirs: true,       // Skip build artifacts
}

Currently, the watcher uses hardcoded filtering instead of per-location indexer rules. Integration with the indexer rules engine is planned, which will allow each location to have custom filtering rules configured through the UI.

When indexer rules integration is complete, the watcher will automatically filter events based on each location’s configured rules, ensuring consistency between initial indexing and real-time change detection.

Configuration

Watcher Settings

Configure the watcher for your needs:

LocationWatcherConfig {
    debounce_duration: Duration::from_millis(100),  // Event consolidation
    event_buffer_size: 1000,                        // Queue size
    debug_mode: false,                              // Detailed logging
}

Performance Tuning

For large directories (>100k files):

Increase buffer size to prevent event loss
Use longer debounce periods
Consider excluding cache directories

For network drives:

Enable polling fallback
Increase debounce duration
Monitor connection stability

For SSDs vs HDDs:

SSDs: Shorter debounce, larger buffers
HDDs: Longer debounce for mechanical latency

Watching Events

Subscribe to location events in your code:

let mut events = event_bus.subscribe();

while let Ok(event) = events.recv().await {
    match event {
        Event::EntryCreated { path, .. } => {
            println!("New file: {}", path);
        }
        Event::EntryModified { path, .. } => {
            println!("File changed: {}", path);
        }
        _ => {}
    }
}

Event Flow

When a file changes:

Operating system detects change
Watcher receives raw event
Event is filtered and debounced
Structured event is created
Event bus broadcasts to subscribers
Services react (indexer, sync, UI)

Managing Locations

Pause Watching

Temporarily stop monitoring without removing:

spacedrive location pause <location-id>

Update Settings

Change location configuration:

# Disable watching
spacedrive location update <location-id> --watch false

# Change index mode
spacedrive location update <location-id> --mode shallow

Remove Location

Stop tracking a directory:

spacedrive location remove <location-id>

Removing a location doesn’t delete files. It only stops Spacedrive from tracking them.

Troubleshooting

High CPU Usage

If watching causes high CPU:

Check for directories with rapid changes
Increase debounce duration
Exclude problematic subdirectories
Temporarily disable watching

# Find active locations
spacedrive location list --verbose

# Disable problematic location
spacedrive location update <id> --watch false

Missing Changes

If file changes aren’t detected:

Verify location has watching enabled
Check file system permissions
Ensure platform limits aren’t exceeded
Try restarting the watcher

Duplicate Events

If you see the same change multiple times:

Increase debounce duration
Check for symlink loops
Verify you’re not watching overlapping paths

Debug Mode

Enable detailed logging:

# Set debug mode
export SPACEDRIVE_WATCHER_DEBUG=1

# Run with verbose logging
spacedrive --log-level debug

Platform Limits

macOS

FSEvents may coalesce rapid changes
Volume-level monitoring affects all locations on a drive

Linux

inotify has a watch descriptor limit (usually 8192)
Increase with: echo 524288 | sudo tee /proc/sys/fs/inotify/max_user_watches

Windows

Long paths require special handling
Network drives may fall back to polling

Best Practices

Location Organization

Nested locations are supported - You can add both /Documents and /Documents/Work as separate locations
- The file tree is shared, not duplicated
- Each location provides a different “view” into the same hierarchy
- Useful for applying different index modes or policies to subdirectories
Group related content - One location per project or media type
Consider performance - Separate frequently-changing directories

Exclusion Patterns

Create .spacedriveignore files to exclude:

# Build artifacts
node_modules/
target/
dist/

# Cache directories
.cache/
*.tmp

# Large generated files
*.log
*.dump

Network Locations

For network-attached storage:

Use Content index mode (avoid Deep)
Increase debounce to 500ms+
Monitor connection stability
Consider scheduled indexing instead

Integration

Services Using Location Events

Indexer: Re-analyzes modified files
Search: Updates search index
Sync: Propagates changes to other devices
Thumbnails: Regenerates previews
Frontend: Updates UI in real-time

Event Priority

Critical events are processed first:

User-initiated changes
Create/delete operations
Modifications
Metadata updates

Implementation Details

Database Schema

Locations utilize three main tables: locations - Location metadata

CREATE TABLE locations (
    id INTEGER PRIMARY KEY,
    uuid BLOB NOT NULL UNIQUE,
    device_id INTEGER NOT NULL,
    entry_id INTEGER,  -- References entries(id)
    name TEXT,
    index_mode TEXT NOT NULL,
    scan_state TEXT NOT NULL,
    -- ... other fields
    FOREIGN KEY (device_id) REFERENCES devices(id),
    FOREIGN KEY (entry_id) REFERENCES entries(id)
);

entries - All files and directories (see Data Model) entry_closure - Transitive closure table for hierarchy (see Data Model) directory_paths - Directory path cache

CREATE TABLE directory_paths (
    entry_id INTEGER PRIMARY KEY,  -- References entries(id)
    path TEXT NOT NULL,
    FOREIGN KEY (entry_id) REFERENCES entries(id)
);

Query Patterns

Find all entries in a location:

SELECT e.* FROM entries e
INNER JOIN entry_closure ec ON e.id = ec.descendant_id
WHERE ec.ancestor_id = (
    SELECT entry_id FROM locations WHERE uuid = ?
);

Get full path for a directory:

SELECT path FROM directory_paths
WHERE entry_id = ?;

Get full path for a file:

-- Walk up parent chain and concatenate with directory path
SELECT dp.path || '/' || e.name as full_path
FROM entries e
LEFT JOIN entries parent ON e.parent_id = parent.id
LEFT JOIN directory_paths dp ON parent.id = dp.entry_id
WHERE e.id = ?;

Data Model - Complete database schema
Indexing - How files are analyzed
Sync - Cross-device synchronization
Events - Event system architecture

Architecture

File Management

Sync & Network

Development

​What is a Location?

​Location Architecture

​Root Entry

​Why Root Entries?

​Entry Hierarchy

​Directory Paths Table

​Device Ownership & Sync

​Core Properties

​Index Modes

​Scan States

​Adding Locations

​Using the CLI

​Using the API

​What Happens When You Add a Location

​File System Watching

​Platform Support

​Event Types

​Automatic Filtering

​Configuration

​Watcher Settings

​Performance Tuning

​Watching Events

​Event Flow

​Managing Locations

​Pause Watching

​Update Settings

​Remove Location

​Troubleshooting

​High CPU Usage

​Missing Changes

​Duplicate Events

​Debug Mode

​Platform Limits

​macOS

​Linux

​Windows

​Best Practices

​Location Organization

​Exclusion Patterns

​Network Locations

​Integration

​Services Using Location Events

​Event Priority

​Implementation Details

​Database Schema

​Query Patterns

​Related Documentation

What is a Location?

Location Architecture

Root Entry

Why Root Entries?

Entry Hierarchy

Directory Paths Table

Device Ownership & Sync

Core Properties

Index Modes

Scan States

Adding Locations

Using the CLI

Using the API

What Happens When You Add a Location

File System Watching

Platform Support

Event Types

Automatic Filtering

Configuration

Watcher Settings

Performance Tuning

Watching Events

Event Flow

Managing Locations

Pause Watching

Update Settings

Remove Location

Troubleshooting

High CPU Usage

Missing Changes

Duplicate Events

Debug Mode

Platform Limits

macOS

Linux

Windows

Best Practices

Location Organization

Exclusion Patterns

Network Locations

Integration

Services Using Location Events

Event Priority

Implementation Details

Database Schema

Query Patterns

Related Documentation