AppVeyor: Discussion

From git, perform shallow clone

2014-05-21T03:20:33Z

Hi Chris,

This is how it was before, but after few clients reported issues with depth=50 it was reverted back to a full clone. 50 was not enough for some cases like large rebases. Rather than playing with the depth (which is clearly a bet) we are thinking on two possible solutions to this problem:

1) Using GitHub API calculate required depth (distance) from required commit to the last one right before calling clone. It's definitely better than a fixed number and in the most cases the depth will be 1. However, there is still a probability of another push between querying for depth and cloning the repo.

2) Use GitHub API to download required commit (ref) as a single zip. We did some proof-of-concepts and it works really well. This method is my favorite :) through if you are relying on .git folder in your builds it won't work for you.

From git, perform shallow clone

2014-05-30T21:34:41Z

Hey Chris,

We've just deployed a new feature called "shallow clone" which uses GitHub API to grab specific commit's zipball. It's experimental, but you can give it a try by putting in your appveyor.yml:

shallow_clone: true

Let me know if it makes your build lighter.

From git, perform shallow clone

2014-05-31T18:16:47Z

Hey Feodor:

Thanks for implementing that feature. However, building my project is taking over 30 minutes, so I am unable to use the service (unless, of course, I decide to pay). It is a fantastic service, though! I hope I'll be using it in the future. It's evident you're rapidly addressing your users' concerns.

From git, perform shallow clone

2014-05-31T19:19:24Z

Ah, OK :) I'm wondering what's taking so long in your build?

From git, perform shallow clone

2014-05-31T22:13:35Z

It's just a big C++ project.

From git, perform shallow clone

2014-05-31T22:16:33Z

I see. How long does build take on your current CI server and what's its configuration?

From git, perform shallow clone

2014-06-02T14:34:48Z

I need shallow clone too, but I'm using the web UI instead of .yml config. Is there a way I can use this feature? Without it, my clones take 10+ minutes and I cannot use the AppVeyor service. :(

From git, perform shallow clone

2014-06-02T16:21:10Z

Hi David,

Have you tried running shallow_clone: true through appveyor.yml? I'm wondering how long would it take for your project to download it. It's experimental feature and we'd like to collect some feedback before making it on UI.

From git, perform shallow clone

2014-06-02T16:41:04Z

I haven't. In order to do that, I'd have to entirely switch to .yml
configuration, right? I can't do that because we have different configs
depending on branch, and I'm going to need to set up multiple build configs
in the UI.

From git, perform shallow clone

2014-06-02T18:35:56Z

Yes, you should switch to appveyor.yml.

But you can have different appveyor.yml for every branch, no? That's the beauty of this approach - build config is stored along with your sources and it's versioned! When I do a new branch I inherit appveyor.yml from master and then just update appveyor.yml to make it work with a new branch. When AppVeyor starts a new build it downloads branch-specific YAML config.

From git, perform shallow clone

2014-06-02T18:47:53Z

We have a Production repository and a development repo with a master branch for releasable code, a develop branch for code that should end up on a testing server, and various feature branches that aren't deployable.

As features are tested, they're merged to develop to end up on testing. When completed, they're pull requested into master which deploys to our UAT instance. When master is ready, it gets pushed to the protected production repository. Each of those workflow operations would overwrite or merge failure on the dissimilar yml file and thus require manual conflict resolution. Then, what happens if I accidentially merge incorrectly, and now just pushed out UAT system settings to production?

That sort of fragility is what I avoid with settings stored in the UI instead. We're currently on TeamCity, where all of our settings are stored in the UI, but hoping to move over to AppVeyor. However, I can't store CI settings in the main repository for these reasons.

From git, perform shallow clone

2014-06-02T18:59:38Z

Ah, I see. Thanks for describing your scenario. Indeed, merging into master with appveyor.yml changes might do a mess...

OK, we will add "Shallow clone" checkbox on UI - hopefully will push it in today's update.

From git, perform shallow clone

2014-06-02T19:02:11Z

If downloading through zip still takes too long we'll add configurable depth parameter for git to see if it helps.

From git, perform shallow clone

2014-06-02T19:12:38Z

We do not currently have a CI server for Windows. On Linux (travis-ci), the
build takes no more than 10 minutes.

From git, perform shallow clone

2014-06-02T19:21:26Z

So you have dual-platform project and trying to establish automatic builds on Windows platform too? I guess it's a private project on Travis, right?

From git, perform shallow clone

2014-06-02T19:28:17Z

Correct. Well, Windows, Mac, Linux. It's public:
https://travis-ci.org/simbody/simbody.

From git, perform shallow clone

2014-06-02T19:39:19Z

Well, I see there would be other things besides cloning the repo like make tools, compiler (we have VC++ right now), etc. Do you have any plan for making it built on Windows? Maybe we could deploy a separate build worker image to play with such projects...

From git, perform shallow clone

2014-06-02T19:45:14Z

Oh that is not an issue for me. I think I can do everything I need to do,
except that it takes longer than 30 mins. See my appveyor script:
https://github.com/chrisdembia/simbody/blob/patch-3/appveyor.yml

From git, perform shallow clone

2014-06-02T19:48:33Z

I see. Currently builds run on "Small" Azure instances with one CPU core. Wondering how long would it take to run it on "Medium" instance with 2 cores...

From git, perform shallow clone

2014-06-03T02:25:52Z

Under which settings screen should I look for that checkbox?

From git, perform shallow clone

2014-06-03T04:00:48Z

What checkbox?

From git, perform shallow clone

2014-06-03T04:52:38Z

David,

Just wanted to let you know that AppVeyor update with shallow clone/depth on UI has been deployed. You can see these settings on "General" tab of project settings.

Let me know how it goes.

From git, perform shallow clone

2014-06-03T11:48:26Z

Definitely a HUGE improvement. However, it took about a minute and a half
to download the 70 MB commit snapshot. When I download it locally, GitHub
downloads at 6.5 MB/s (the download takes just over 10 seconds). If you're
using Azure small machines, you should have 100 Mbit/s available, so you
should be able to get the 6.5 MB/s. Any idea what's wrong?

From git, perform shallow clone

2014-06-03T16:47:26Z

That's interesting. Right, I think downloading zip is not an issue. My guesses would be a) packaging commit on GitHub side or b) unzipping archive on AppVeyor side. The bottleneck may be either CPU or I/O.

How many files are there in repo?

From git, perform shallow clone

2014-06-03T17:21:00Z

You're completely right -- its almost definitely the unzip, which I hadn't
tried locally, because of the huge volume of files. Would a git shallow
clone be faster than unzipping?

From git, perform shallow clone

2014-06-03T17:24:34Z

Yes, another option is trying out "Clone depth" which adds --depth parameter to git clone command.

From git, perform shallow clone

2014-06-03T17:28:33Z

But I'm wondering if it's I/O problem or CPU. I've been noticing that VMs sometimes are not very cool in doing disk ops.

Just for curiosity, what if you set clone folder (General tab) to some location at d: drive, let say d:\projects\test. D: drive is "temp" storage on Azure VMs, it's transient and it is local hypervisor hard drive, not NAS. Would it be faster or slower? :)

From git, perform shallow clone

2014-06-03T19:16:11Z

Azure VMs get a hard limit of 500 IOPS per disk on all VM sizes from XS
to XL. Since you're using small VMs, you could get more IO by creating a
RAID0 of two data disks. I do this on my Azure Extra Large VMs for database
servers where I RAID0 8 disks. Theres no data loss potential for RAID0
because the disks are stored in locally redundant storage.

However, for checkouts, you are right that the temp disk is probably a
better bet. I'll give it a shot an report back.

From git, perform shallow clone

2014-06-03T19:24:00Z

No slower, but no faster.

Clone depth = 50 took 3x longer than the ZIP.

From git, perform shallow clone

2014-06-03T19:34:38Z

OK, got it. I like the idea with RAID. Will play with it some day.