Mercurial repository identification

Tags:

mercurial

I need to be able to uniquely identify a Mercurial repository and have that identifier placed in a file that is included when cloned. If I can put the identifier in a file in the .hg folder that is preferable to simply adding a normal file to the repo.

I understand that I can get a near certain identifier from the first changes that are committed. I know that the hgrc file cannot be used to store the identifier, because it is not cloned.

So, my question is: Is there another file in the .hg folder that is cloned that I can use to put the identifier? Thanks.

646

asked Jun 10 '11 13:06

RandyR

1 Answers

From first read, it sounds like you want to be able to make sure that a clone of the repository is a clone of the correct repository and not some stand-in impostor. However, if the identification information you're thinking of using is cloned with everything else, then an impostor would still pass this test. You'd need to keep that identifier separate so that it can be compared against information in the clone.

Whether that is your purpose or not, any file in .hg that is cloned you may not want to edit. You'd have to add a file to be tracked in the other areas of the repo, outside of .hg. However, you don't really need an extra file at all, as the changeset hash is not just near certain, but very certain, so the information for handily identifying a repository is built-in to the repository itself.

On the commandline, you can get either the short or full versions of the very first changeset's hash identifier:

> hg id -i -r0
89abf5502e3c

> hg log -r0 --template "{node}"
89abf5502e3c5c65e532db04d8d87141f0ac8b73

If I am correct about your desire to compare 2 identifiers so that you or someone else knows a clone of the repository is a true clone and not a false clone, you would have the same changset id available separately so that someone can use one of the above commands to see the id of their clone and compare it to what you say it should be. This is much like how many websites with downloadable executable files show a hash identifier next to the download link so that you can hash the file yourself and compare the result to the hash on the website.

Edit regarding your comment that sheds light on the purpose of this:

Since you need to be able to read it from a file, there are a couple options:

Tracked file in repository root

There is one file you might consider, other than creating your own: .hgtags.

hg tag -r0 ident

...would tag the very first revision, allowing you to use ident as a reference to that changeset rather than -r0. Mercurial always uses tag information from the latest version of .hgtags, no matter what changeset the working directory is updated to, but that may not matter to your app. hg tag appends a line such as this to the .hgtags file, creating the file if it doesn't exist:

a247494248c4b96a571bbd12e90eade3bf559281 ident

This is most handy if you don't have a tags files yet in your repos, because it will be the first line in the file for easy finding. You might think could simply write this file yourself, but then you'd still have to call hg to get the changeset id and again at some point for adding it to tracking and then committing: hg tag does all that for you.

If there is already the possibility of a tags file to consider, that's ok, too, because they tend to be relatively short and you just need to look for the 1 line that ends with your chosen tag name. Mercurial is designed for append-only operations to .hgtags, but everything would still work fine if you inserted the line for this tag as the very first line if .hgtags already exists because: 1. The tag will never be moved or removed. 2. You'll be using a tag name not already used in the file.

Reading `hg`'s guts

There are files that normally only Mercurial itself touches deeper in .hg that can be read to get the first changeset's hash. I looked into Mercurial's File Formats, Revlog, and RevlogNG, and at least for 2 of my own repos, .hg\store\00changelog.i contains the first changeset's hash at offset 0x20 (20 byte length). Probably, at least since Mercurial 0.9, it will be the same in all repos. RevlogNG also notes the first 4 bytes of that file will indicate Revlog version number and flags. While the changeset id is only 20 bytes long currently, the actual field for it is 32 bytes long, probably for future expansion to a longer hash.

Since this option requires no alteration of existing repositories and only involves reading the first 52-64 bytes of the main index, it's the one I'd probably go with. If I was catching this requirement in the early stages of the product before any repos it manages were out in the wild, I would lean toward the custom file approach because I would probably have my own metadata file created and added from the beginning of the repo.

177

answered Nov 05 '22 05:11

Joel B Fant

Related questions
                            
                                hg remove -I PATTERN, how it works?
                            
                                tool to inspect mercurial's internal files
                            
                                mercurial: running remote regression tests automatically on every commit
                            
                                How to push, without creating new heads, after creating multi-head branch, rebase and several merges
                            
                                Using subrepositories with bitbucket and ToritoiseHg
                            
                                How to name an anonymous branch?
                            
                                Find changeset using commit message mercurial
                            
                                Why is mercurial's hg rebase so slow?
                            
                                Migrating from Mercurial to Subversion [duplicate]
                            
                                Is it possible to turn off keyword substitution for 'svn export'?
                            
                                Is it possible to checkout a single directory from a Mercurial (HG) repository?
                            
                                One repository/multiple projects without getting mixed up?
                            
                                How to get hardlink cloning on Windows
                            
                                "No such repository hgadmin" while installing mercurial-server.
                            
                                Does another version control system have a ClearCase-like filesystem access to file versions?
                            
                                How to use `hg cat` from an empty working directory?
                            
                                How can I sync my custom CodePlex Mercurial fork against the main trunk?
                            
                                Mercurial: R with 'hg status', how to commit?
                            
                                How can I find parent file of one single file after copying?
                            
                                How do you init a new Mercurial project with a different username?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Mercurial repository identification

Tags:

mercurial

RandyR

People also ask

1 Answers

Tracked file in repository root

Reading `hg`'s guts

Joel B Fant

Recent Activity

Donate For Us

Mercurial repository identification

Tags:

mercurial

RandyR

People also ask

1 Answers

Tracked file in repository root

Reading hg's guts

Joel B Fant

Related questions

Recent Activity

Donate For Us

Reading `hg`'s guts