TestBot News

Wed May 4 12:54:50 CDT 2022

Hi François!

On 5/4/22 16:11, Francois Gouget wrote:
> On Thu, 28 Apr 2022, Zhiyi Zhang wrote:
> [...]
>> Could you make them go faster? Maybe balancing the load a bit or
>> adding more hardware?
> 
> The issue is that WineTest takes time and new jobs have to wait for
> running tasks to complete to get their turn. But then they have priority
> over WineTest.
> 
> I collected some data about the WineTest tasks (see attached
> spreadsheet) and they take between 25 minutes on Windows and 35 minutes
> on Linux. The main issue here is VMs that have many test configurations
> which must therefore be run sequentially. The three VMs with the longest
> chains are:
> 
>    Time  Configs VM
>    6.7 h    11   debian11
>    6.7 h    16   w1064
>    6.8 h    15   w10pro64
> 
> What this means is that no amount of rebalancing can get the tests to
> run in less than about 7 hours.
> 
> And here are the results at the VM host level:
> 
> Time    Configs Host
>    7.2 h    12   vm1
>    1.4 h     3   vm2
>    7.7 h    18   vm3
>   12.1 h    25   vm4
> 
> The issue is vm2 is too slow and old to run most VMs nowadays. So moving
> some test configurations from vm4 to vm1 or vm3 will push those to 9 /
> 10 hours. So I'll restart the process of getting new hardware to replace
> vm2.
> 
> The other options:
> * Fix the tests that get stuck: they waste 2 minutes each.
>    But it looks like there's only two of those left, conhost.exe:tty and
>    wscript.exe:run, so there's not much to gain.
> 
> * Speed up the slow tests, potentially by using multi-threading. What
>    sucks is we have no way of tracking which tests are slow, which test
>    configurations are slow, etc. It would be nice to have something like
>    the patterns page but for runtime (and also for the tests output
>    size).
> 
> * Getting hardware with faster single thread performance: over 90% of
>    the tests are single-threaded. vm2 is meant to be the first step
>    towards this.
> 
> * Splitting the VMs with many test configurations so the test load can
>    be spread across multiple hosts. That is, instead of having a single
>    VM with 15 test configurations that must run sequentially like
>    w10pro64, have two VMs with 7 and 8 configurations each that can run
>    in parallel. But that makes an extra VM to manage and requires having
>    hosts to spread them to :-(
> 
> * Load balancing could help, assuming the TestBot is smart enough.
> 
>    That is, if it starts by running the debiant and w7u tasks on vm4,
>    then by the time the other hosts are idle all that's left to run is
>    w10pro64's 15 test configurations that must be run sequentially
>    anyway. So the scheduler must give priority to the VMs with the
>    highest count of pending tasks.
> 
>    Load balancing could help reduce the latency by ensuring the builds
>    are done earlier. Here's a worst case scenario right now:
>      t=0  vm2 starts a WineTest job
>      t=1  Developper submits a job. First comes the build step
>      t=25 vm2 completes the WineTest job
>      t=25 vm1, vm3 and vm4 each start a new WineTest job
>      t=26 vm2 completes the developer's build task
>      t=50 vm1, vm3 and vm4 complete their WineTest task
>      t=51 vm1, vm3 and vm4 starts the developer's Windows tasks
>    Having multiple build VMs would make it more likely that the blocking
>    build step is completed before any other WineTest task.
>    This is also why it's good that vm2 is not too busy.
> 
> * Reducing the number of test configurations :-(
> 
> 

Thanks indeed for the tesbot, it's been really useful, so much that I'm 
sometimes a bit worried of over-using it.

There's one thing that I've been wondering for a bit, and that I believe 
have an impact on the occupation:

When a patch is submitted that is detected as potentially touching more 
than a single test, all the tests for the module are queued for testing. 
However this isn't done through WineTest, and instead they are all 
queued and tested separately, at least on the Windows VMs.

Wouldn't it be better to always run the tests through WineTest, and make 
it run all the tests that need checking at once?

For instance https://testbot.winehq.org/JobDetails.pl?Key=113995, which 
had to run all the dinput tests took apparently ~1h to complete (if I 
trust the elapsed time compared to the job from the patch before it, as 
I believe it is somehow cumulative). The total time for the individual 
tests is more about ~25min, but I think the VM cleanup time adds up a lot.

I also think that it would make it easier to reproduce some of the 
failures that the nightly runs suffer from, and which are related to 
tests badly interacting with each other.

Another question, unrelated to the performance problems, could we 
consider adding more Desktop/WM environments to the Debian VMs? I think 
it could be interesting to have to track down winex11 bugs, though it's 
probably likely to have several broken tests.

I also intend at some point, when win32u conversion will be more 
settled, to finish sending my nulldrv patches, and I think it'd be nice 
to have a testbot flavor that could be configured to use it instead of 
the default graphics driver.

I believe it mostly works already, except for the user32:monitor and 
user32:sysparams tests, but it needs some registry changes to use it. It 
could be interesting to have to make the user32/win32u tests more reliable.

Cheers,
-- 
Rémi Bernon <rbernon at codeweavers.com>