Can inode and crtime be used as a unique file identifier?

Tags:

I have a file indexing database on Linux. Currently I use file path as an identifier. But if a file is moved/renamed, its path is changed and I cannot match my DB record to the new file and have to delete/recreate the record. Even worse, if a directory is moved/renamed, then I have to delete/recreate records for all files and nested directories.

I would like to use inode number as a unique file identifier, but inode number can be reused if file is deleted and another file created.

So, I wonder whether I can use a pair of {inode,crtime} as a unique file identifier. I hope to use i_crtime on ext4 and creation_time on NTFS. In my limited testing (with ext4) inode and crtime do, indeed, remain unchanged when renaming or moving files or directories within the same file system.

So, the question is whether there are cases when inode or crtime of a file may change. For example, can fsck or defragmentation or partition resizing change inode or crtime or a file?

Interesting that http://msdn.microsoft.com/en-us/library/aa363788%28VS.85%29.aspx says:

"In the NTFS file system, a file keeps the same file ID until it is deleted."
but also:
"In some cases, the file ID for a file can change over time."

So, what are those cases they mentioned?

Note that I studied similar questions:

How to determine the uniqueness of a file in linux?
Executing 'mv A B': Will the 'inode' be changed?
Best approach to detecting a move or rename to a file in Linux?

but they do not answer my question.

420

asked Apr 17 '13 20:04

jhnlmn

2 Answers

{device_nr,inode_nr} are a unique identifier for an inode within a system
moving a file to a different directory does not change its inode_nr
the linux inotify interface enables you to monitor changes to inodes (either files or directories)

Extra notes:

moving files across filesystems is handled differently. (it is infact copy+delete)
networked filesystems (or a mounted NTFS) can not always guarantee the stability of inodenumbers
Microsoft is not a unix vendor, its documentation does not cover Unix or its filesystems, and should be ignored (except for NTFS's internals)

Extra text: the old Unix adagium "everything is a file" should in fact be: "everything is an inode". The inode carries all the metainformation about a file (or directory, or a special file) except the name. The filename is in fact only a directory entry that happens to link to the particular inode. Moving a file implies: creating a new link to the same inode, end deleting the old directory entry that linked to it. The inode metatata can be obtained by the stat() and fstat() ,and lstat() system calls.

128

answered Oct 13 '22 22:10

wildplasser

The allocation and management of i-nodes in Unix is dependent upon the filesystem. So, for each filesystem, the answer may vary.

For the Ext3 filesystem (the most popular), i-nodes are reused, and thus cannot be used as a unique file identifier, nor is does reuse occur according to any predictable pattern.

In Ext3, i-nodes are tracked in a bit vector, each bit representing a single i-node number. When an i-node is freed, it's bit is set to zero. When a new i-node is needed, the bit vector is searched for the first zero-bit and the i-node number (which may have been previously allocated to another file) is reused.

This may lead to the naive conclusion that the lowest numbered available i-node will be the one reused. However, the Ext3 file system is complex and highly optimised, so no assumptions should be made about when and how i-node numbers can be reused, even though they clearly will.

From the source code for ialloc.c, where i-nodes are allocated:

There are two policies for allocating an inode. If the new inode is a directory, then a forward search is made for a block group with both free space and a low directory-to-inode ratio; if that fails, then of he groups with above-average free space, that group with the fewest directories already is chosen. For other inodes, search forward from the parent directory's block group to find a free inode.

The source code that manages this for Ext3 is called ialloc and the definitive version is here: https://github.com/torvalds/linux/blob/master/fs/ext3/ialloc.c

answered Oct 13 '22 23:10

Gary Wisniewski

Related questions
                            
                                Packet mangling utilities besides iptables? [closed]
                            
                                How to detect segmentation fault details using Valgrind?
                            
                                JVM signal chaining SIGPIPE
                            
                                What does this warning mean?
                            
                                I need to debug PHP. What is my best choice?
                            
                                How to manage project dependencies using Maven?
                            
                                Finding location using MCC, MNC, LAC, and Cell ID
                            
                                How to make docset on Linux?
                            
                                Is there any .NET Core compatible library for reading excel spreadsheet file? [closed]
                            
                                Remote Debugging .NET Core Linux Docker Container - "the current source is different from the version built into .dll"
                            
                                Compilation gcc 4.6.2 (cannot compute suffix of object files)
                            
                                Recording messages received on a port with SOCAT
                            
                                How does ffprobe determine duration?
                            
                                php file automatically renamed to php.suspected
                            
                                Change Tx Bluetooth Linux No Effect
                            
                                Capture shutdown command for graceful close in .NET Core
                            
                                Docker macvlan network, unable to access internet
                            
                                Linux: write a C program that 'controls' a shell
                            
                                Configuration error when installing R on Linux [closed]
                            
                                *Almost* Perfect C Shell Piping

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Can inode and crtime be used as a unique file identifier?

Tags:

linux

inode

jhnlmn

People also ask

2 Answers

wildplasser

Gary Wisniewski

Recent Activity

Donate For Us