PathResolver: The Simple Explanation

What Problem Does It Solve?

Imagine you’re giving someone directions to a room in a building:

“Go to the office, then into the storage closet, then back out, then into the conference room.”

That’s a lot of steps! A smart person would simplify it:

“Just go to the conference room.”

PathResolver does exactly this for file paths.

The Problem: Messy Paths

When programs work with files, they often create messy paths like:

/home/user/../user/./documents/../documents/report.txt

This path says:

Go to /home/user
Go back up (..)
Go to user again
Stay here (.)
Go to documents
Go back up (..)
Go to documents again
Finally, report.txt

That’s exhausting! The simple answer is just:

/home/user/documents/report.txt

PathResolver’s job is to figure out the simple answer.

Why Can’t the Backend Just Do This?

Good question! Here’s why we separated it:

1. Different Backends, Same Logic

Think of backends like different types of filing cabinets:

MemoryBackend = Files in your brain (RAM)
anyfs-sqlite: SqliteBackend = Files in a database (ecosystem crate)
VRootFsBackend = Files on your hard drive

The path simplification logic is the same for all of them:

.. means “go up one level”
. means “stay here”
Symlinks mean “actually go over there instead”

Why write this logic three times? Write it once, use it everywhere.

2. We Can Test It Alone

If path resolution is buried inside each backend, testing is hard:

❌ To test path resolution, you need:
   - A real backend
   - Real files
   - Complex setup

With PathResolver separated:

✅ To test path resolution, you need:
   - Just the resolver
   - Simple inputs and outputs
   - No files required!

3. We Can Benchmark It

“Is our path resolution fast enough?”

If it’s mixed with everything else, you can’t measure it. Separated, you can:

#![allow(unused)]
fn main() {
// Easy to benchmark!
let resolver = IterativeResolver::new();
benchmark(|| resolver.canonicalize("/a/b/../c", &mock_fs));
}

4. We Can Swap It

Different situations need different approaches:

Resolver	Best For
`IterativeResolver`	General use (walks path step by step)
`CachingResolver`	Repeated paths (remembers answers)
`NoOpResolver`	Real filesystem (OS already handles it)

With separation, switching is one line:

#![allow(unused)]
fn main() {
use anyfs::resolvers::{CachingResolver, IterativeResolver};

// Default
let fs = FileStorage::new(backend);

// With caching (for performance)
let fs = FileStorage::with_resolver(
    backend, 
    CachingResolver::new(IterativeResolver::default())
);
}

Think of PathResolver like a GPS system separate from your car:

Component	In AnyFS	In a Car
Storage	Backend (MemoryBackend, SqliteBackend)	The roads themselves
Navigation	PathResolver	GPS device
Interface	FileStorage	Dashboard

Why is GPS a separate device?

✅ You can upgrade the GPS without changing the car
✅ You can test the GPS in a simulator
✅ Different GPS apps can work in the same car
✅ The car maker doesn’t need to be a GPS expert

Same reasons we separated PathResolver!

What PathResolver Actually Does

Input:  /home/user/../admin/./config.txt
                  ↓
         [PathResolver]
                  ↓
Output: /home/admin/config.txt

Step by step:

/home/user → go to user’s home
.. → go back up to /home
admin → go into admin folder
. → stay here (ignore)
config.txt → the file!

Result: /home/admin/config.txt

Symlinks Make It Interesting

Symlinks are like shortcuts. If /home/admin is actually a symlink pointing to /users/administrator, the resolver follows it:

Input:  /home/admin/config.txt
        (but /home/admin → /users/administrator)
                  ↓
         [PathResolver]
                  ↓
Output: /users/administrator/config.txt

The Two Main Methods

`canonicalize()` - Strict Mode

“Give me the real, final path. Everything must exist.”

#![allow(unused)]
fn main() {
resolver.canonicalize("/a/b/../c/file.txt", &fs)
// Returns: /a/c/file.txt (if it exists)
// Error: if any part doesn't exist
}

`soft_canonicalize()` - Relaxed Mode

“Resolve what you can, but the last part doesn’t need to exist yet.”

#![allow(unused)]
fn main() {
resolver.soft_canonicalize("/a/b/../c/new_file.txt", &fs)
// Returns: /a/c/new_file.txt (even if new_file.txt doesn't exist)
// Error: only if /a/c doesn't exist
}

This is useful for creating new files—you need to know WHERE to create them, but they don’t exist yet!

Summary: Why Separate?

Benefit	Explanation
Testable	Test path logic without touching real files
Benchmarkable	Measure performance in isolation
Swappable	Different resolvers for different needs
Maintainable	One place to fix bugs, benefits all backends
Understandable	Each piece has one job

In Code

#![allow(unused)]
fn main() {
// The trait (the "job description")
// Only canonicalize() is required - soft_canonicalize has a default implementation
pub trait PathResolver: Send + Sync {
    fn canonicalize(&self, path: &Path, fs: &dyn Fs) -> Result<PathBuf, FsError>;
    
    // Default: canonicalize parent, append final component
    // Handles edge cases (root path, empty parent)
    fn soft_canonicalize(&self, path: &Path, fs: &dyn Fs) -> Result<PathBuf, FsError> {
        match path.parent() {
            Some(parent) if !parent.as_os_str().is_empty() => {
                let canonical_parent = self.canonicalize(parent, fs)?;
                match path.file_name() {
                    Some(name) => Ok(canonical_parent.join(name)),
                    None => Ok(canonical_parent),
                }
            }
            _ => self.canonicalize(path, fs),  // Root or single component
        }
    }
}

// For symlink-aware resolution (when backend implements FsLink):
pub trait PathResolverWithLinks: PathResolver {
    fn canonicalize_following_links(&self, path: &Path, fs: &dyn FsLink) -> Result<PathBuf, FsError>;
    // soft_canonicalize_following_links also has a default that delegates
}

// FileStorage uses it (boxed for flexibility)
pub struct FileStorage<B> {
    backend: B,                        // Where files live
    resolver: Box<dyn PathResolver>,   // How to simplify paths (boxed: cold path)
}
}

That’s it! PathResolver answers one question: “What’s the real, simple path?”

The soft_canonicalize variant is just a convenience - it reuses canonicalize internally but allows creating new files.

Everything else—reading files, writing files, listing directories—is the backend’s job.

AnyFS Ecosystem Manual