Vtuber Superchat Data Analysis - Hololive Charts

If you want to follow along, the associated files are hosted here.

The main data science process can be found here.

Skip to here for charts. The following part is the code to set up the charts.

Data Management + Representation

Well, we now have all of the data we need to conduct our studies on, but a lot of it is not yet ready to work with. For example, the video start and end times are given in ISO8601 format strings (with a T between the date and the time, and a Z indicating that the time zone is UTC+0. We can't immediately work with these strings and need to change them to datetime objects first. We also need to do the same with the superchat timestamps, which are given as timestamps in microseconds. We need to also change those to datetimes.

Since this part is run separately from the previous ones (so that we don't need to do that gruesome data collection again), we start by opening the file we saved earlier.

CHARTS

Correlation between time of day and superchats

All times are in UTC. Add 9 hours to get JST.

HOLOLIVE JP CHARTS

HOLOLIVE EN CHARTS

HOLOLIVE ID CHARTS

Time of Day and Superchats: Summary

Although we did not do any quantitative analyses here nor find anything groundbreaking, we did corroborate some concepts that might have seemed like common sense.

Possible Future Work

While we qualitatively asserted that the shape of the "superchats per hour" distribution was similar but different to the "streams live at any time" distribution, we did not back this up with numbers. In future work, one could use the 2-sample Kolmogorov-Smirnov test (another link for more info) to test if the two datasets have the same shape.

Already, we can see the r-value is higher. This is good, as it indicates it matches up with our data better. Let's plot it:

We can also plot the residuals, to see if our regression is good.

We can split our residuals into 4 equally-wide groups (based on the view count) and make a violin plot to check for normality.

Anyways, from our basic regression, it seems like superchat earnings can be roughly modelled by the equation:

$y = 0.0663126 x^{0.758142}$

Where x is the number of viewers and y is the superchat revenue.

Video game streams - better or worse?

Next, let's see if video game streams do better or worse than this average.

Interesting. The r-value is higher (giving us a $r^2 \approx 0.4$), implying that this regression is a better fit for this subset of data than our original regression line was for the entire dataset, but that's probably to be expected. Also, this regression line has a greater slope but lower intercept than the original regression. A possible hypothesis to explain this is that in general people do not superchat as much on game streams, as evidenced by the fact that most of the data points have fewer views than the intersection of the two regression lines. However, for popular games that a lot of people enjoy (and thus also watch), they are more willing to shell out money. Of course, checking whether this is the case or not is out of scope of this project.

Of course, the residuals exhibit similar behavior as before, and there no reason to believe that this is a bad fit.

Next, let's take a closer look at the most popular games of the past few weeks.

Seems like there are 6 games that were streamed over 25 times among all Vtubers in this study. We can plot these all to see if anything interesting is apparent immediately.

No trends jump out yet. Let's take a closer look at each one, and add regression lines. (Residuals have been omitted for brevity, but if we really wanted to be rigorous we could add those).

While it does seem like some of these games have significantly different slopes (in particular, the Among Us) streams, it is important to note that there were only 24 points used for this stream's regression, so it's highly likely that the regression for this is inaccurate. Overall, we're starting to get too few data points in each category to be reliable, so we should probably end the analysis here for views vs. superchat earnings, and conclude that individual game titles make little difference on superchat earnings.

Note: I talked with a Vtuber fan friend of mine who said that Among Us streams tend to not get many superchats. This seems to be corroborated by the data.

Views vs Superchats Findings, summarized

Takeaways

While we weren't able to make a one-size-fits-all formula to model the amount of money a Vtuber will make from superchats, we gained a lot of insight into the nature of superchat earnings.

Hopefully you found this interesting and learned something new!

Future Possibilities

Vtubers list

This is a list of all the Vtubers I used in my dataset, with links to their channels. It is sorted in descending order of subscribers.Sorry if your favorite isn't here!

Name Affiliation
A.I.Channel independent
Gawr Gura Ch. hololive-EN hololive_en
Korone Ch. 戌神ころね hololive
Pekora Ch. 兎田ぺこら hololive
フブキCh。白上フブキ hololive
Mori Calliope Ch. hololive-EN hololive_en
Marine Ch. 宝鐘マリン hololive
Aqua Ch. 湊あくあ hololive
Watson Amelia Ch. hololive-EN hololive_en
Rushia Ch. 潤羽るしあ hololive
HAACHAMA Ch. 赤井はあと hololive
Coco Ch. 桐生ココ hololive
Noel Ch. 白銀ノエル hololive
Okayu Ch. 猫又おかゆ hololive
Matsuri Channel 夏色まつり hololive
Takanashi Kiara Ch. hololive-EN hololive_en
Ninomae Ina'nis Ch. hololive-EN hololive_en
Suisei Channel hololive
Subaru Ch. 大空スバル hololive
Watame Ch. 角巻わため hololive
Kanata Ch. 天音かなた hololive
Botan Ch.獅白ぼたん hololive
SoraCh. ときのそらチャンネル hololive
月ノ美兎 nijisanji
Mio Channel 大神ミオ hololive
Moona Hoshinova hololive-ID hololive_id
本間ひまわり - Himawari Honma - nijisanji
犬山たまき / 佃煮のりおチャンネル independent
Towa Ch. 常闇トワ hololive
Nene Ch.桃鈴ねね hololive
Kureiji Ollie Ch. hololive-ID hololive_id
鈴原るる【にじさんじ所属】 nijisanji
リゼ・ヘルエスタ -Lize Helesta- nijisanji
アルス・アルマル -ars almal- 【にじさんじ】 nijisanji
戌亥とこ -Inui Toko- nijisanji
Ayunda Risu Ch. hololive-ID hololive_id
竜胆 尊 / Rindou Mikoto nijisanji
天野ピカミィ. Pikamee voms
Pavolia Reine Ch. hololive-ID hololive_id
Airani Iofifteen Channel hololive-ID hololive_id
鷹宮リオン / Rion Takamiya nijisanji
夢月ロア🌖Yuzuki Roa nijisanji
Anya Melfissa Ch. hololive-ID hololive_id
魔界ノりりむ nijisanji
ぽちまる:POCHI-GOYA channel independent
Roberu Ch. 夕刻ロベル holostars
緋笠トモシカ - Tomoshika Hikasa - voms
Shien Ch.影山シエン holostars
ピーナッツくん!オシャレになりたい! independent
Aruran Ch. アルランディス holostars