AppVeyor: Discussion

Did recent changes apply to possibly slow down builds?

2018-11-12T18:41:43Z

Hi Alex,

In the end of last week we indeed found a node with degraded performance and replaced it during a weekend. Now it should be back to normal, but I see that this and this builds are still failed and this happens on presumably healthy nodes.

We do not see a performance degradation on our side, but we will investigate more.

Meanwhile we increased your build timeout to 4 hours. If you do not like it and want to fail fast, you can decrease it in General tab of project settings in Build timeout, minutes. Also new button called RE-RUN INCOMPLETE is coming soon (hopefully this week or so) which will allow you to re-run only failed and cancelled jobs in matrix.

Also build worker image update happened on Friday evening. Do you think that any of those changes could affect the performance?

And finally, I would recommend to try to run jobs which fails often on Visual Studio 2017 image and let us know if it behaves differently. You can set image pretty granularity with APPVEYOR_BUILD_WORKER_IMAGE environment variable as described here. You are using Visual Studio 2017 Preview this way now.

Please keep in touch and let us know what you found.

We from out side are looking into performance issues deeply now and plan some major datacenter upgrades and migrations in the near future.

Ilya.

Did recent changes apply to possibly slow down builds?

2018-11-12T21:52:12Z

Ok thanks for all the information! The increased timeout will hopefully help for now (thanks!) and we'll keep our eyes peeled on our end.

Did recent changes apply to possibly slow down builds?

2018-11-13T19:14:31Z

Do y'all perhaps have statistics for if the VMs that we're running on are shared with other possibly high-cpu workloads? (or maybe even our own workloads?)

Comparing this 1h36m build with this 3h17m build the build got nearly 2x slower with very similar code being tested. Our own analysis shows that building the Rust compiler, a very CPU intensive workload, was nearly 40% slower in the latter build than the previous build. (compiling the Rust compiler does a little I/O but is almost always bound by CPU/memory).

A still currently running build is executing over an hour slower than the previous build as well :(. If y'all have any data you can share about the hosting environment and if we're on maybe noisy machines to help explain this, that'd be much appreciated!

Did recent changes apply to possibly slow down builds?

2018-11-13T19:15:34Z

Er sorry I meant to mention earlier, but the build image update doesn't seem like it'd affect us much, it's mostly compiler toolchain revisions and/or runtime updates which tend to affect our builds the most.

Did recent changes apply to possibly slow down builds?

2018-11-13T21:31:12Z

We experiencing very high load lately and this affects I/O (CPU is OK). This obviously should not affect you. We are doing the following things at the moment:

Short term solution -- we are decreasing builds density at the moment - high load and noisy neighbors will not affect you (or at least will affect much less) at peak hours. Note however that side effect of that will be that some builds at peak hours will run on Google cloud. Performance there is good, but build start time is 3-4 minutes (time to provision a VM), which you can neglect with your build times.

Long term -- we are working on adding new datacenter for our Hyper-V infrastructure. I cannot say exact ETA, but it should be added in a couple of weeks.

Also we propose you to forcible run some specific very heavy jobs on Google cloud. For that please set environment variable appveyor_build_worker_cloud to gce for those jobs in the matrix.

Please let us know how it goes.

Did recent changes apply to possibly slow down builds?

2018-11-13T21:44:20Z

Oh that sounds perfect, thanks for the information! I've sent a PR to switch to GCE, and I also sent a PR to switch to VS2017 preview images. It'll probably take awhile for those to land and get a feeling if we see any more timeouts, but we'll get back to you if anything shows up!

Did recent changes apply to possibly slow down builds?

2018-11-19T16:36:08Z

This appears to have basically solved our issue, thanks so much again for the tip!

Did recent changes apply to possibly slow down builds?

2018-11-21T01:53:30Z

Just noticed that your current build is running in our main datacenter and appveyor.yml for the auto branch does not have this setting. Trying to track down related changes in your repo... If you have an idea how this happened, please let me know.

Did recent changes apply to possibly slow down builds?

2018-11-21T03:36:13Z

Oh no worries! That's to be expected. All our CI happens on the auto branch regardless of what the destination branch is, so our master branch has the fix (scheduled on GCE) but our beta branch doesn't have the fix yet (it's not forcibly scheduled on GCE). That build you saw was form the beta branch, so it's just configuration on our end!