aboutsummaryrefslogtreecommitdiff
path: root/fs/btrfs/disk-io.c
AgeCommit message (Collapse)AuthorFilesLines
2008-09-25Btrfs: Always use the async submission queue for checksummed writesChris Mason1-7/+0
This avoids IO stalls and poorly ordered IO from inline writers mixing in with the async submission queue Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Enable btree balancing on old kernels againChris Mason1-3/+0
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Change the congestion functions to meter the number of async submits ↵Chris Mason1-0/+9
as well The async submit workqueue was absorbing too many requests, leading to long stalls where the async submitters were stalling. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Fix btrfs_open_devices to deal with changes since the scan ioctlsChris Mason1-2/+2
Devices can change after the scan ioctls are done, and btrfs_open_devices needs to be able to verify them as they are opened and used by the FS. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Add mount -o degraded to allow mounts to continue with missing devicesChris Mason1-20/+29
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Handle write errors on raid1 and raid10Chris Mason1-5/+51
When duplicate copies exist, writes are allowed to fail to one of those copies. This changeset includes a few changes that allow the FS to continue even when some IOs fail. It also adds verification of the parent generation number for btree blocks. This generation is stored in the pointer to a block, and it ensures that missed writes to are detected. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Pass down the expected generation number when reading tree blocksChris Mason1-17/+13
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Don't do btree balance_dirty_pages on old kernels, it stalls foreverChris Mason1-0/+8
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Add support for online device removalChris Mason1-55/+46
This required a few structural changes to the code that manages bdev pointers: The VFS super block now gets an anon-bdev instead of a pointer to the lowest bdev. This allows us to avoid swapping the super block bdev pointer around at run time. The code to read in the super block no longer goes through the extent buffer interface. Things got ugly keeping the mapping constant. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Fixes for 2.6.18 enterprise kernelsChris Mason1-5/+19
2.6.18 seems to get caught in an infinite loop when cancel_rearming_delayed_workqueue is called more than once, so this switches to cancel_delayed_work, which is arguably more correct. Also, balance_dirty_pages can run into problems with 2.6.18 based kernels because it doesn't have the per-bdi dirty limits. This avoids calling balance_dirty_pages on the btree inode unless there is actually something to balance, which is a good optimization in general. Finally there's a compile fix for ordered-data.h Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Deal with failed writes in mirrored configurationsChris Mason1-2/+15
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Drop some verbose printksChris Mason1-13/+5
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Make the resizer work based on shrinking and growing devicesChris Mason1-0/+4
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Add failure handling for read_sys_arrayChris Mason1-2/+9
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Fix the unplug_io_fn to grab a consistent copy of page->mappingChris Mason1-1/+12
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Deal with page == NULL in the btrfs_unplug_io_fnChris Mason1-2/+30
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Make an unplug function that doesn't unplug every spindleChris Mason1-11/+15
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Remove debugging statements from the invalidatepage callsChris Mason1-1/+1
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Scale the bdi ra_pages by the number of devices in the FSChris Mason1-1/+3
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Force page->private removal in btrfs_invalidatepageChris Mason1-0/+12
btrfs_invalidatepage is not allowed to leave pages around on the lru. Any such pages will trigger an oops later on because the VM will see page->private and assume it is a buffer head. This also forces extra flushes of the async work queues before dropping all the pages on the btree inode during unmount. Left over items on the work queues are one possible cause of busy state ranges during truncate_inode_pages. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Set the btree inode i_size to OFFSET_MAXChris Mason1-7/+26
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Don't drop extent_map cache during releasepage on the btree inodeChris Mason1-9/+14
The btree inode should only have a single extent_map in the cache, it doesn't make sense to ever drop it. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Only do async bio submission for pdflushChris Mason1-0/+7
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Create a work queue for bio writesChris Mason1-3/+90
This allows checksumming to happen in parallel among many cpus, and keeps us from bogging down pdflush with the checksumming code. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Add chunk uuids and update multi-device back referencesChris Mason1-1/+5
Block headers now store the chunk tree uuid Chunk items records the device uuid for each stripes Device extent items record better back refs to the chunk tree Block groups record better back refs to the chunk tree The chunk tree format has also changed. The objectid of BTRFS_CHUNK_ITEM_KEY used to be the logical offset of the chunk. Now it is a chunk tree id, with the logical offset being stored in the offset field of the key. This allows a single chunk tree to record multiple logical address spaces, upping the number of bytes indexed by a chunk tree from 2^64 to 2^128. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: A few updates for 2.6.18 and versions older than 2.6.25Chris Mason1-2/+10
This includes fixing a missing spinlock init call that caused oops on mount for most kernels other than 2.6.25. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: bio_endio support for linux 2.6.23 and older.Miguel1-1/+4
bio_endio() changed prototype on linux 2.6.24, support older kernels using the older prototype. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Endianess bug fix for v0.13 with kernelsMiguel1-2/+2
Fix for a endianess BUG when using btrfs v0.13 with kernels older than 2.6.23 Problem: Has of v0.13, btrfs-progs is using crc32c.c equivalent to the one found on linux-2.6.23/lib/libcrc32c.c Since crc32c_le() changed in linux-2.6.23, when running btrfs v0.13 with older kernels we have a missmatch between the versions of crc32c_le() from btrfs-progs and libcrc32c in the kernel. This missmatch causes a bug when using btrfs on big endian machines. Solution: btrfs_crc32c() macro that when compiling for kernels older than 2.6.23, does endianess conversion to parameters and return value of crc32c(). This endianess conversion nullifies the differences in implementation of crc32c_le(). If kernel 2.6.23 or better, it calls crc32c(). Signed-off-by: Miguel Sousa Filipe <[email protected]> --- Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Add extra checks to avoid removing extent_state from pages we can't freeChris Mason1-0/+6
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Write out all super blocks on commit, and bring back proper barrier ↵Chris Mason1-5/+113
support Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Retry metadata reads in the face of checksum failuresChris Mason1-19/+53
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Handle data block end_io through the async work queueChris Mason1-11/+23
Before it was done by the bio end_io routine, the work queue code is able to scale much better with faster IO subsystems. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Do metadata checksums for reads via a workqueueChris Mason1-34/+224
Before, metadata checksumming was done by the callers of read_tree_block, which would set EXTENT_CSUM bits in the extent tree to show that a given range of pages was already checksummed and didn't need to be verified again. But, those bits could go away via try_to_releasepage, and the end result was bogus checksum failures on pages that never left the cache. The new code validates checksums when the page is read. It is a little tricky because metadata blocks can span pages and a single read may end up going via multiple bios. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Add additional debugging for metadata checksum failuresChris Mason1-1/+2
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Fix allocation profile initChris Mason1-7/+4
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Add support for duplicate blocks on a single spindleChris Mason1-3/+5
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Add support for mirroring across drivesChris Mason1-13/+7
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Verify checksums on tree blocks found without read_tree_blockChris Mason1-36/+50
Checksums were only verified by btrfs_read_tree_block, which meant the functions to probe the page cache for blocks were not validating checksums. Normally this is fine because the buffers will only be in cache if they have already been validated. But, there is a window while the buffer is being read from disk where it could be up to date in the cache but not yet verified. This patch makes sure all buffers go through checksum verification before they are used. This is safer, and it prevents modification of buffers before they go through the csum code. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Fix btrfs_fill_super to return -EINVAL when no FS foundYan1-2/+1
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Reorder the flags field in struct btrfs_header and record a flag on writeoutChris Mason1-4/+9
This allows detection of blocks that have already been written in the running transaction so they can be recowed instead of modified again. It is step one in trusting the transid field of the block pointers. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Add leak debugging for extent_buffer and extent_stateChris Mason1-0/+2
This also fixes one leak around the super block when failing to mount the FS. Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Use a higher default ra pagesChris Mason1-1/+1
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Create a btrfs backing dev infoChris Mason1-0/+53
This allows intelligent versions of unplug and congestion functions Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Implement raid0 when multiple devices are presentChris Mason1-0/+2
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Add support for device scanning and detection ioctlsChris Mason1-9/+15
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Bring back mount -o ssd optimizationsChris Mason1-0/+2
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Move device information into the super block so it can be scannedChris Mason1-1/+3
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Dynamic chunk and block group allocationChris Mason1-1/+1
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Add support for multiple devices per filesystemChris Mason1-6/+91
Signed-off-by: Chris Mason <[email protected]>
2008-09-25Btrfs: Allocator improvementsChris Mason1-1/+1
Reduce CPU time searching for free blocks by optimizing find_first_extent_bit Fix find_free_extent to make better use of the last_alloc hint. Before it was often finding blocks just before the hint. Signed-off-by: Chris Mason <[email protected]>