Showing posts with label fanfic. Show all posts
Showing posts with label fanfic. Show all posts

Tuesday, 11 January 2011

FanFiction.Net Fandoms: Story and Traffic Statistics

FanFiction.Net Statistics include all fandoms in this analysis. The amount of data collected on January 1, 2011 is enormous, and we now have the ability to compare how each fandom registered within FanFiction.Net grew since our first release.

We start with the basics and site-wide descriptive statistics before entering top-level categories (Anime, Books, Games et cetera) and delving into individual fandoms. This research paper involves not only the biggest fandoms, but also more obscure, yet dynamic communities. Comparative charts and future growth predictions are presented to illustrate the trends. Off-site sources aid the study in audience profiling and global fandom trends. Due to the volume of analysis, it is presented in several consecutive posts to save scrolling space and browsing resources.

The goal of this release is to present you every category’s health check, trying to find the most resilient top-tier category of 2010.

Warning: Reading this text may take some time, so you can do it in parts. Everyone can post a comment. FFN Research has a new release in store, but it’s always nice to know what questions interest individual readers. The text was written to be simple enough for ages 13 and up, but if something confuses you or you find an error, please notify.

BASIC INFORMATION

FanFiction.Net has 5879 fandoms (series/categories). These fandoms contain 3,744,842 stories.

The site houses 621 fandoms more than in July 15, 2010, or 1368 new fandoms since the end of 2009. 2010 created 23% of all fandoms you see now, and it was a 30% increase in total fandoms since the previous year.

An average fandom on FFN has 637 stories. A median fandom has 14 stories (69 fandoms have 14 stories), which is two stories less than six months ago. The mode fandom has 1 story (793 fandoms have only 1 story).

Below, you see how top categories fared in 2010, ranked by size. The biggest winners and losers are highlighted.


Anime/Manga has been the largest contributor in 2010, responsible for 27.6% of growth site-wide. The category gained almost 18% of new fanworks this year. Cartoons, along with TV, had more than a 25% increase in-category. Plays have shrunken by 12.7%, but their minuscule weight on FFN overall has totaled to only a -0.4% decrease in the annual growth. Nonetheless, it is strange to see a whole top-level category lose weight throughout the year. No media categories shifted in rank since the beginning of 2010.

In total, FFN grew by 20% in 2010 and received over 627,000 new story uploads. The site’s account total rose by a similar value.

NEW FANDOMS

Numerous fandoms arrived to the site in 2010, affecting the top media category structure. Below, you see a table with the total number of fandoms in every category on two dates and two story count meters. These are explained as follows along with other columns that may raise questions:

One – the number of fandoms in a category that has exactly one story. It is included to point out how many series can be considered a failed venture that did not generate any attention.

Under ten – the number of fandoms in a category that has less than ten (1-9) stories. It is included to point out how many series can be considered a questionable venture. Communities are very fragile at conception, and any fandom that does not have sufficient backbone in story total may not sustain itself.

It’s possible to provide additional counters like Under 100 or under 1000 on demand.

% of new – the share of new fandoms the media category received, relative to the total of new fandoms.

% of category – the increase of fandoms as a percentage of the Jan 1, 2010 fandom total


The biggest winner and loser are highlighted for you. This table reveals a lot about the health a top category has. While it is impossible to assess the sentiment in a particular series from the information above, the general moods that roamed in media categories throughout 2010 can be seen with ease. While Anime/Manga remains the FFN heavyweight in terms of story count, more than 200,000 stories ahead of the closest rival – Books, the latter is a champion of fandom counts. It is an interesting phenomenon that Books, having more fandoms, has less stories than Anime/Manga.

Before we explain it, let’s have a look at the new fandom loser of 2010, Misc. In 2010, Misc had a marginal value of fandoms. But five times more stories than its closest rival, Plays. Misc also had the lowest number of new fandoms. But all of them grew to have more than 10 stories. Misc, like Plays, should be considered anomalous based on the researcher’s opinion. However, it’s difficult to put them aside as a separate category because they depict extreme trends that occur with extreme values.

Looking at the fandom total, Misc has only 35 fandoms as of January 1, 2010. Plays have almost three times as much, but less than 100. Comics are their closest companion, and cartoons follow. Games are the middle child of Fan Fiction with a jagged transition into top categories: Anime, Book, Movie and TV. Despite these being easy to categorise by the number of fandoms, there are two more perspectives visible in the table.

TV is the top dog of new fandoms. 365 appeared in 2010, and Books are closing the gap at 332. When it comes to attempts in discovering a new driving force, TV and Books take the cake. Games and Anime are the mediocre, leaving the rest far behind. Now, lots of new fandom is not necessarily a good thing because some of them can fail. TV has a lot of new fandoms, but also a lot of failing series. In fact, the number of fandoms with one story has doubled in TV, relative to the category size. Without this perspective, we see it bright as day that TV has 49 one-fic fandoms in the beginning, and 155 in the end of the year. Talk about a crippling failure rating. The numbers can be even more frightening when you consider the possibilities of past movements in fandoms. A rise from, say 20, to 49 is not as precipitous as what we see now.

6 out of 9 categories have more questionable fandoms in 2011 than they did in 2010. 755 Book communities have less than ten stories. That is more than half of the total number of fandoms in Book. The situation is similar in other categories large by story count like Anime, TV and Movies, but not Games. Once again, Games position themselves in the middle.

By this point, it might get difficult to put all the numbers in one system, so a clutch point is necessary. In 2010, the number of questionable fandoms (under 10 stories) has risen by 40%. The total number of fandoms – 30%. In the beginning of 2011, 2600 out of 5879 fandoms had under ten stories. Ergo, the site’s questionability rating is 44%. If 44% of fandoms are in questionable condition, 56% are not, and it might explain why the site is eager to accept more fandoms. Statistics show that a new request is bound to be more successful than not.

The trend may overturn soon, though. Questionable fandoms are taking up more server space as time passes. Since their amount is rising quicker than the total number of fandoms, the series are spreading themselves thinly. How thinly? In the end of 2009, the questionability rating, under ten vs total fandoms was 41.2%. In the end of 2010, this value is 44.2%. This means that the possibility of a fandom to grow has diminished. By a margin, but an important one. If FFN is a litmus test of fan fiction trends in the world, the questionability rating is a litmus test of series (books, TV shows, games) gaining creative support.

Further illustrating the point, let’s have a look at Chart 1 with failure and questionability rating changes. This is important: the bars represent changes since the beginning of 2010, not the ratings themselves.


Categories in the chart are ranked from left to right by total story count from biggest to smallest.

Comics experienced the highest increase in failure rating, which means the amount of new fandoms in Comics was more prone to fail than in any other top category. But don’t let the percentage increase (45%) fool you, because we’re dealing with fandom numbers 8 and 19, not hundreds. This is where a small top category with 30k of stories in total may skewer perception.

Large categories should provide a more accurate display of sentiments in fandom. Failure ratings are increasing in them more than by 10%, while the increase in questionability is above 50%, with Games dropping out. If you were to draw a line from the tip of one bar (blue or yellow) to another, you’d notice a trend of sorts (tilde or squiggle), with Games, once again, dropping out of context.

Trend or no trend, categories, which have a lot to offer in terms of variety and story count, see an increase in questionability and failure. This increase weighs a lot more than any decrease available in smaller categories, only Games acting as a dampening agent. Misc and Plays did not have a dramatic increase in failure ratings partially because they lacked numerosity of fandoms in 2010 ie, did not provide enough data for a feasible conclusion in terms of dynamics.

But there still is the general outlook. Here’s a list of questionability ratings as a percentage of fandoms in the category as of January 1, 2011:
Anime – 39%; Book – 58%; Cartoons – 27%; Comics – 35%; Games – 39%; Movie – 49%; Plays – 52%; TV – 39%. Misc have 0.

These expose a fact, which may not be up to date. Saying that, in general, 58% of fandoms in Books have not gone to grow into two-digit areas does not mean this applies for 58% of fandoms created in 2010. In some cases, this applies more than 58% because the questionability ratings have, on average, increased. To turn the “some” into exact values, though, we need to find out exactly, which series contributed to overall growth.

2010 has been a productive year for several new fandoms. Names like Inception or Socrerer’s Apprentice, having come in the second half of 2010, should not surprise anyone. Since one of these happened to come to the top of 2010 fandoms, the table below reflects the last six months of the year for context.


As you can see, TV shows are dominating the table in fandom numbers (9), movies coming second (7) with two books and two cartoons filling the remaining spots. Interestingly, a Movie, not a TV show got first place. The first two fandoms, being Inception and Sherlock, leave any competition far behind. Inception appeared on FanFiction.Net on July 14th, and Sherlock – July 29th. Making a recount based on day count, Inception has a marginal lead (0.2 of a story). If the top two create a clear distinction on the list, the next nine fandoms form another group: less than 1000, more than 100 stories, which welcomes one Book, Heroes of Olympus. The third group, less than 100 ends with two Cartoons, and no Games, which have stayed in the middle of lists so far, in sight. Anime also failed to make the margin.

For categories that did not make it in the top twenty, here is a short list with the top fandom and a list they would start being present in:
Anime – Togainu no Chi (41) – Top 30
Comics – Teenage Mutant Ninja Turtles (28) – Top 40
Games – Minecraft (14) – Top 50

Neither Plays nor Misc make it to a top ten list, pushed beyond the first hundred. But what about averages, you might ask? Surely, there might be some fandoms on the top, but, among several hundred fandoms, there might be a concentration issues, so one category takes a row of spots in the rank somewhere in the middle, while everyone else has skewed. Such an observation sounds reasonable, so some descriptive statistics are in order. For obvious reasons, you won’t see the mode. If they’re not obvious, guess, what’s the most common new fandom story number. One. The number is two for Plays, but that category has shown odd results in other parts of the analysis, so it shouldn’t surprise. The median makes sense only for Movies and TV since their top fandoms shift the average a lot, but we dodge this by removing them from the analysis (in parentheses).

Anime – 4.2
Books – 3.8
Cartoons – 13.2
Comics – 4.4
Games – 3.5
Misc - …
Movies – 16.6 (8.3)
Plays – 1.5
TV – 23.9 (14.3)

In total, new fandoms have generated under 14,000 stories in 2010.

That concludes the part dedicated to new fandoms.

TOP FANDOMS

Having analysed new arrivals on FanFiction.Net, a part of the audience may have gotten anxious about things more down to earth – the big players on FFN. Below, you have a top twenty table at two dates, the beginning of 2010 and 2011 along with changes in rank. Fandoms, that have gotten more popular in 2010, compared to 2009. If rank changes, the fandom is highlighted. When the number next to the fandom’s name is negative, it moves up (5-2 = 3, higher in the rank). Do not be alarmed that the sum of “+” does not equal the sum of “–“ as some fandoms that appear on the list were not in it before.


The top four does not change throughout 2010. With hefty gaps greater than 50,000 stories, that is easy to explain. Commotion occurs in the middle of the list with certain fandoms jumping over others in rows. The quickest jumper on the list is Pokemon, which topped five fandoms. Dragon Ball Z, on the other hand, is the biggest loser with five points extra. While Pokemon is a living franchise that may create an n number of games, movies and anime, Dragon Ball Z has a negative perspective. Its abrupt drop through the leaderboard has a further negative perspective due to no new content being released.

Another shift worth inspecting is Supernatural vs Buffy the Vampire Slayer. As of January, 2011, Supernatural, not Buffy, has the #1 in TV shows. The latter has been a long-standing leader in that category due to little activity in other fandoms. In fact, the vampire series has been on FFN since 1998, seven years longer than Supernatural. However, Buffy’s future is stable because there isn’t any TV show able to take its place in the nearest future.

Kingdom Hearts and Yu-Gi-Oh had an odd change in pacing, with the first gaining almost 10,000 stories and the latter only 4,000. While dedicated fans know more about activity in Kingdom Hearts being factored by new content, the side perspective is that Kingdom Hearts’ forums were decimated on FFN on November 25, 2010, the flagship forum losing 8 out of 10 posts out of more than 500,000 present. Apparently, forums and story content do not correlate well in that fandom.

The end of the list has two newcomers responsible for pushing CSI, another long-timer, present since 2001. Avatar: The Last Airbender and Death Note took its place. Both have a positive perspective, considering Sailor Moon and Dragon Ball Z are within range. In fact, these two, especially Avatar, are a threat to Teen Titans, which somehow managed to keep its spot as #18. On January 1, 2011, less than 400 stories separated it from Avatar. A surge of activity in Avatar is expected in the fourth quarter of 2011 when an addition to the series is scheduled.

It is improbable that fandoms, which emerged in 2010 are going to appear on the list in 2011. The main candidate, Inception, was a movie, not likely to gain enough momentum to overtake even Death Note, which has over 26,000 stories, compared to less than 2,000 of Inception.

Likewise, the possibility for an unlisted fandom to appear on the top 20 list is slim. Fandoms that used to be in the top 20, but were pushed down throughout the years, had few truly large competitors. In any case, they would have to overcome outsiders like CSI and Star Wars.

In total, top fandoms generated 230k new stories in 2010. As of January 1, 2011, top fandoms listed a sum of 1.64 million stories, almost half of all stories currently present on FanFiction.Net. The top 20 list also contains 25% of stories ever posted on the site. New fandoms brought 0.7% of this value in 2010.

INACTIVE FANDOMS

You have encountered tiny fandoms that were not likely to gain any new stories due to their size. We referred to them as “failed” (1 story) and “questionable” (less than 10 stories), but these were a projection into the future. There are fandoms on FanFiction.Net, which have not gotten a single new story in the second half of 2010 (or have gotten some stories, but administrative/other deletions brought the number back to the level of Jan 1, 2010, creating a zero sum [period comparative with new fandoms]).

1814 fandoms did not receive a single new story over the last six months. This is more than the total number of new fandoms, 1386. Those 1814 fandoms contain approximately 30,291 stories.

Results are negative for all but one top-tier category. There were more fandoms being idle in every category than becoming active in 2010. In case of Misc, its inability to receive new fandoms throughout 2010 compensates the small ratio, with Anime X-overs being the only idle fandom. The situation may worsen for all Misc fandoms based on X-overs because they were created before the site established non-section crossovers, so they could be placed in the relevant fandom instead of Misc. It causes duplication of resources, but it is not as stunning as a category someone hacked on FFN (spare image).

The list you see below could have had greater inactivity values, specifically, for Plays, but a lower story count provides a completely different situation…decaying, perhaps. When the biggest series in the category (RENT) gets barely 30 extra stories in a year despite 400 being posted, conclusions get colourful.


On a lighter note, TV offers promising activity. The difference between new fandoms and inactive ones is practically non-existent, while other top categories display nearly identical ratios close to the site’s average. It applies in Books, Cartoons, Movies (and Plays). Games, along with Anime appear in a separate group with a high idleness rate. This correlates with one reader’s opinion that the site made a mistake by trusting Anime fandoms to generate its volume in the past year. Indeed, it has the most bulk, but uses a try and try again notion that had below average results in 2010.

CONCLUSION

Let’s recap and make a graphic comparison of all top tier categories. In practice, the table below is a ranked connection of the tables used above with a summary rating in the final column. The lower the value, the better fandoms in that category fared compared to others.


Given the criteria you see, TV was the healthiest top category of all in 2010, and if you want a sustainable experience in fandom, choose TV. Books is a healthy alternative, followed by Anime, Movies and Games. In fact, Games gives you the most average experience on the entire site. It’s not exceptional by any criterion, but it steers clear from any negative ratings and risks.

Cartoons and Misc may surprise you in some ways, but don’t expect much activity or exceptional review counts if you post a story there. And if you’re a really hardcore fan, 2010 offered writership like no other in Comics and Plays. When your life makes too much sense, write in the Plays category. Who knows what awaits you in a category that’s shrinking.

DATA RECAP

FanFiction. Net has 5879 fandoms, 3,744,842 stories.
The average fandom has 647 stories.
The median fandom has 14 stories.
1368 fandoms were created in 2010.
1814 fandoms did not grow in 2010.
2600 fandoms have under 10 stories.
793 fandoms have one story.
One top category shrank.
One fandom was hacked.

Largest fandom – Harry Potter.
Largest new fandom – Inception.

Fandom lists will be available shortly.

End Notes

This is only a part of the data cache collected for FFN Research, but getting it together into readable shape does take a while. Please, show your support to this research blog, so it wouldn’t die like some of the fandoms described above.

Friday, 1 October 2010

Erased Accounts

We're not idle here, don't worry. Since FanFiction.Net has been glitchy as of late, it was next to impossible to publish a list of purged user accounts. Likewise, it was difficult to create a list of good stories with the most objective criteria available.

As of today, the pending analysis of good fan fictions has a 90% confidence level, which is insufficient for further group analysis. No conclusions are presented from this research to prevent erroneous assumptions.

However, we have a static list of accounts deleted in the years 1998 (since October), 1999, 2000 and the first half of the year 2001. Here is the list. 4700 user accounts were purged in this term. It is an accurate list derived from observation for those dates with 98.7% of all accounts from ID 1 to ID 80,000 checked. It is not suggested that you make site-wide conclusions for the current situation, as the domain's growth and guideline changes ensued in 2002 and 2004, which created conditions that would render the numbers attained for the first 80k inapplicable for later years.

The 0.85%, as attained via our earlier sample remains as the accurate number for deleted accounts. Yes, approximately 1 out of 100 accounts is deleted on FFN. If you have 1000 favourite authors, 85 of them will cease to exist due to infringement. Should you contest this number, the accounts in our random sample are provided in a separate file (like in the previous post).

Sunday, 18 July 2010

FanFiction.Net Member Statistics

The research team is proud to present you first numerics from our user-related queries. This post answers many questions, including the following:

-How many writers are there on FFN?
-How long will you stay on FFN?
-How many stories do they write?
-How many users are deleted from FFN for infringement of ToS?
-How quickly does FFN grow?
-How many readers you should expect for a story?

First, we must present the methodology, though. The study consisted of generating 1100 random user account IDs spanning from 1 to 2,400,000 (source data at the bottom). It allows us to generate representative unbiased results at a 95.34% confidence level and a 3% error margin. The list has been generated on the 29th of June 2010. Therefore, we have included all accounts that have been registered, enabled and fully functional, without restrictions of story creation or profile/review posting.

Now, the definitions. You will see the following criteria used in this post:

Empty account: any account that does not host stories uploaded by the owner. In layman terms, there are no stories posted in this account. There may be favourites. Here and here are examples of accounts dubbed 'empty'. Conversely, this is not an empty account.

Active/alive account: any account that has shown signs of life in the past six months, from January 1, 2010. This may be the following: updating or posting a story OR updating the profile OR adding a favourite story OR reviewing a favourite story in the past six months. For example, these two accounts are called 'active' or 'alive' in this post. In the case of the second example, please check the favourites. As long as at least one criterion is met, it is active. Those, who have joined fan fiction in the year 2010 are active by default due to a professional grace period to create a story.

Inactive/dead account: any account that does not meet the active/alive criterion above. Here are two examples.

Deleted account: any user ID that shows the following or similar message "User does not exist or is no longer an active member."

Main Part

You probably recall that FFN has ~3,300,000 stories from our last research (number rounded up to accommodate growth since the previous post), which is 53% of all posted material, with the other 47% deleted. Keep this in mind for a moment.

In the sample of 1100, we have discovered 742 empty accounts, which means, via representativity, only 32.5% of all FanFiction.Net users have stories posted. How does that transfer into general numbers? In a population of 2,400,000 members 781,000 have stories (4.2 stories per account with a story on or 1.375 stories per every member), while the remaining 1,619,000 do not participate in adding content. Two thirds of all members are pure readers, or so it may seem. If it were correct, we could say that 1 writer has 3 dedicated readers on average, if we assume writers themselves read. However, it's not that simple.

Some accounts are plain dead. How many? In a sample of 1100, 855 accounts were inactive, and showed no signs of life in the year 2010. What does that mean for FFN? 78% of all accounts on FanFiction.Net are dead. Less than a fourth, or 22% is currently at your disposal, or 528,000, which is less than the number of accounts with stories on them.

The fun part begins now. How many writers are active? Who could you expect updates from? We connect the overlapping clauses of 'active' and 'not empty'. In a sample of 1100, 130 accounts showed signs of life and had stories on. It translates into: 12% of all accounts on FFN have at least one published story and are actively engaged in fandom activity. 88% of members on FFN are currently not shaping any fandom. As for those, who do, there are 283,000 of them. We have found out that there are 5259 fandoms on FFN, which would mean 54 people keep a fandom alive in the course of 6 months.

On average, no more than 54 people appear in a fandom over six months. How many new people is that per day? 0.3 of a person drops into an average fandom. An average fandom has 681 stories. A median fandom, the one in the middle, which ditches the enormous influence of HP with 0.5 million stories, has 16. That was a bit of extra information, and we now return to users.

One aspect of FFN particularly interested the research team, the number of account deletions by the administrator. 0.73% was the number we acquired. That's less than 1 in 100. However, let us convert that into raw numbers. 17,500. We add an arbitrary 3000 to that number because accounts from 1 to 3000 are unavailable, and the account number generator did not account for it. What do we get? Since September 1998 fanfiction deleted over 20,500 users for infringement. It stands for 0.85% of all users. 4.75 accounts are deleted per day on average, a very modest number because we disregard deletions impossible to document and test easily, like those attributed to policy changes (for instance, when MSTs were deemed unwelcome).

Who would that be? Blacklisted people: spammers, trolls, plagiarists, other infringers. They missed a few trying to use FFN as an advertising venue here and here.

By now, you already know how many account totals are there. It's time to break them into a time series and give you an understanding of how quickly FFN grows.

A table below tackles this issue. We need to explain the columns for complete clarity:

Total: the last account ID created in the year (AKA summary number of accounts created until December 31, all years including the one in the row [accounts made this year + all accounts made in the previous years])
Change: number of accounts that were created in the year in question
Growth%: how much accounts FFN gained in comparison to the previous year, excluding accounts created in the previous years.
CChange%: chained value of change. The ratio of Change (this year to last) divided by the ratio of Total. Answers how quicker (above 1)/slower(below 1) grew this year in comparison with the previous, acceleration.
Middle: the date when half of the annual growth is reached, 50% of accounts created in that year are already present by this date.

Year - Total - Change - Growth% - CChange - Middle
1999* - 6749 - ... - ... - ... - ...
2000 - 33,090 - 26,620 - 411.4 - ...
2001 - 147,200 - 114,110 - 344.8 - 0.19
2002 - 318,900 - 171,700 - 116.6 - 0.16
2003 - 512,000 - 193100 - 60.6 - 0.32 - June 22
2004 - 733000 - 221000 - 43.2 - 0.5 - June 13
2005 - 959000 - 226000 - 30.8 - 0.55 - June 29
2006 - 1188200 - 229200 - 23.9 - 0.63 - June 21
2007 - 1458900 - 270700 - 22.8 - 0.78 - June 17
2008 - 1788000 - 329100 - 22.6 - 0.81 - June 3
2009 - 2238000 - 450000 - 25.2 - 0.89 - May 31
2010** - 2680000 - 442000 - 19.8 - 0.66 - July 21

*Accounts created in 1998 added. It is impossible to tell when exactly a person joined before 2000-01-07.
**estimated, based on the first 6 months.

Before we begin analysing the data, there is an explanation for our 2010 estimate. We calculated it according to seasons, not a plain average. Based on our calculations, by June 21 the site receives 50% of its annual account growth spurt. This means that slightly more accounts are created in the first half of the year, than in the next six months. Site-wide, there is no reason to assume 'big' events like the release of a movie or a new popular book create significant fluctuations. Years before 2002 were not included due to volatility while the site was still young.

Now, let's carry on with the examination. As you can see in the Total column, the site is growing every year. Rational. The Change column shows that an increasing number of people joins the site up to 2010, with the period from 2004 till 2006 being stable in terms of Change. Things become trickier with Growth% and CChange. Some of you may be confused why a site which is growing more and more in raw numbers seems to score poorly in the last two columns. The explanation is as follows: as the site grows, it needs a larger number of new accounts to sustain itself. Simple example: site with 1000 accounts made in the previous year gets 1000 more this year. Next year, it will be 2000 accounts. If the site grows another 1000 next year, this 1000 will be relatively smaller (50% vs 100%) than the first. The same is happening to FFN, as it gains a similar number of accounts that weigh less and less.

The rate of acceleration or slowing down is most visible in CChange. Not a single value is higher than 1, which means the site never grew faster than the year before. On the contrary, the rate of slowing down, the closer to zero the less momentum the site gains compared to last year. From 2000 till 2009, deceleration (slowing down) was becoming closer to 1, a sustainable equilibrium point, but the year 2010 returns us to levels of 2006.

In layman terms, imagine two speeding cars. One of them is the site, and the other is 1, how the site did last year. The other car is a ghost/time challenge type that repeats the race as it was before. The ghost reaches the finish line first every time because your car never reaches the value of 1. You lose one race. Next time, the ghost repeats how you raced the time you lost. And again. Meaning, every race the ghost is slower, repeating your losses. You keep losing, though. While you do, you notice that if at first you lost by a long shot, after several runs, you still lose, but 1 is a lot closer.

If it weren't for 2010, a great gap in a seemingly fluent continuity, we could have made an obvious conclusion that FFN will, eventually, grow faster, and its growth will be bigger both in volume and ratio that volume takes in the whole (your car will start a winning streak).

Regression analysis showed that there is a polynomial relationship between time and growth. Linearly, there is a positive relationship and a linear trendline would claim that the site will reach CChange=1 in 2012. With an R^2=0.825.

A polynomial trend fits better, with R^2=0.9 for the parabola. It means that the function you will see below 'catches' 90% of all vibrations that our growth spurt (CChange) makes, and best describes fluctuations in growth on FFN. What does that R^2 mean? 90% of all growth fluctuations are explained by time in the function below.

y = -0,0094x^2 + 0,218x - 0,4813

y - CChange value

x - number of years since 1998 (0, 1, 2, et cetera)

Basically, this function allows us to calculate the future of FFN. What is it? Well, according to this, the CChange value will be 0 when the site reaches 21 years of age or by year 2019. This is the scenario we follow if the site does not gain momentum by 2012. If we employed descriptive statistics, any CChange above 0.779 and under 0.3 would have been considered anomalous (the rule of three standard errors). Removing those values gives us a more pessimistic, yet less accurate, picture of these events. Reaching 1 would take three years longer linearly, and negative CChange would also be acquired sooner in more reliable polynomial models. Our choice on extrapolation is based on the principle of numeric accuracy, provided other factors remain static. Surely, clever website management and an increased interest in fan fiction as a concept is bound to change the end result. It does, however, suggest that site administration would avoid the trend described in this exercise.

As a final part of this piece of research, we would like to address a number we have shown you before 12%, the number of accounts that have stories on and currently participate in fandom. Another 10% are active readers and do not have any stories posted. This is a general number, though, and we are sure You are more curious to know where do you stand with your peers rather than the whole site.

Below is a table with the following columns:
Year: year of joining.
Full: possibility% that your account is still active and has stories if you joined in the designated year
Empty: possibility% that your account is still active, but has no stories, if you joined in the designated year
Full stays: the probability% that if you have stayed until July 2010, you have stories on

We start from the year 2002, when initial FFN volatility abated. Empty in 2010 is skipped.

Year - Full - Empty - Full stays
2002 - 6.4 - 2.5 - 71.4
2003 - 8.5 - 1.1 - 88.9
2004 - 3.7 - 1.9 - 66.7
2005 - 5.7 - 2.3 - 71.4
2006 - 9.1 - 2.0 - 81.8
2007 - 9.1 - 5.8 - 61.1
2008 - 16.2 - 2.8 - 85.2
2009 - 18.6 - 21.3 - 46.6
2010 - 28.4

Interestingly, you are more likely to stay over a year on FFN if you have stories and are a writer than if you were just a reader. However, you have an equal chance of staying on FFN for a year, writer or reader alike. Regardless, if you join FFN, chances are you will not write a story and you will not be on the site longer than six months.

Even if you have written a story, it is most probable that you will not be on the site longer than six months. This is a generous time period, and it could be that six months is the most probable activity lifespan because it is the starting point and anything smaller does not exist in this part.

We have worked on regression to give you an easy way to calculate the perspectives of staying on FFN. A fifth degree polynomial function seemed to have the biggest R^2=0.99. Amusingly, the probability would go down to negative 1700% very quickly after 8 years, so we had to switch to a simpler parabolic function with R^2=0.96.

Y=0,0218x2 - 0,2603x + 0,961

x - the number of years you have/are intending to stay on FFN. (Works for values up to 10 years).

y - % that you will stay.

According to the given function, it is least likely that you will stay on FFN for 6 years. Thus, yes, more likely that it will be 7 or 8. We attribute this to some form of fandom patriotism the earliest members have expressed to the site. A more precise function would have to include account deletions, which should, in reality, lower active account rates (remember the 3000 first accounts?) and the possibility of staying much longer than 8 years. In any case, the function above is presented for your amusement. A more informative variant is below.

We understand that it might be difficult to imagine the contextual difference between 6% and 9% dominant in the previous table. For this reason, we have made a coefficient, so 28.4%=1. This way, you will see more clearly how many active accounts die away, and how many stay active.

8 years 23%
7 years 30%
6 years 13%
5 years 20%
4 years 32%
3 years 32%
2 years 57%
1 year 65%
0 years 100%

The process can be done further if you want to see how many % of 65% et cetera die in the following years.

Active fanfic participating accounts (those that make up 12% on the site, remember that) lose 35% of their numbers in the first year. The second most rapid drop is in 3 years, but people who tend to stay 3 years are prone to staying 4. The last accurate piece of data that coerces with the trend: the more time passes, the less people stay, is 6 years. Only 1/8 of the people who are active writers right after joining remain this way. 7/8 chip off during the trip. As such, the number of permanent contributors (who stay on the site for years) increases as FFN grows. There is only one 'but': the increase is majorly consumed by users abandoning their accounts.

Those, who have spent less than 6 months account for 6.5% (29.5%) of the 22% of people that are active in any way. Another 7.3% (33.1%) come from those, who have spent more than a year. As such, it is reasonable to say that almost two thirds of the site is actively inhabited by inexperienced account owners, rated 'fans' in forums. So-called 'fanatics' make up a third of the active population, a third that spans since 1998 till the beginning of 2009. On the one hand, it is peculiar that the amount of active newbies (writers or just readers) is almost equal to that of 'fanatics'. On the other, it should make quality control out of the question. Why does it not even out? A question we leave in your hands, dear readers.

Conclusion

Unless FFN manages to speed up its growth potential, those 12% that currently shape the fandom will not be enough, especially because ~5 accounts are deleted every day. The site needs to replace more than 35% of active users every year, and 2010 so far looks the most challenging yet. More dedication, fellow fans. May the concept of fan fiction prosper.

Added: here is a list of user accounts in our sample.

Question: What about people who just go to forums, aren't they active?
Answer: They do not make use of the site's core service as a fan fiction archive. If you don't write or read stories, you are considered inactive. The only way a forum goer could be included as active (provided they have no stories or favourites) is if they updated their profile this year.

Wednesday, 7 July 2010

Most Popular Categories

This post is to clear confusion on FFN about what is popular on FanFiction.Net, what is not, and why. All statements you will see in our report are based on raw point data, collected on 29th June, 2010. This means that everything has been taken in machine fashion directly from the site in the method known as observation, not via sample by omitting fandoms. (It was scooped by looking at the numbers, for the younger readers.)

As such, the presented data has a 100% confidence level. However, we understand the value of server delays and are including an arbitrary 3% error margin because the data is taken from the top-category view, not by trawling inside every fandom. If you recall from the previous post, top-category views show a slightly bigger number of stories (up to 5% for some fandoms) for active fandoms (with over 50 stories), so this is included due to FFN site dynamics as a precaution, despite the inclusion making no difference statistically. This is made necessary further because a part of our target audience has not passed a statistics course.

To make this more interesting, we suggest that everyone takes a guess, which top category is more popular: Anime/Manga, Books, Games, TV shows et cetera, from the list of 10 on the front page.

Depending on your age, the answer is probably 'Games' or 'Books'. The answer would not be far from truth, but not even Harry Potter, the biggest fandom on the site gives Books the top spot. Conversely, it's not the combination of Kingdom Hearts, Pokemon and other games, all with the biggest forums on FFN.

The largest top-level category on FanFiction.Net is Anime/Manga with over 1,062,835 publicly available stories. It also has numerous subcategories/fandoms, 955. Despite this, a third of them (over 300) has under 10 stories, with the bulk concentrated in Naruto 240,635 and Inuyasha 93,196.

If you recall, FFN has approximately 3,200,000 live stories, which means every third story found on FFN is related to Anime/Manga. Why? We can't answer that just yet like we can't tell you why only 1 in 50 writers becomes a Beta Reader.

Anime and Manga have #1. Now, for all Harry Potter, Twilight, LotR, Warriors, PJO et cetera fandoms, you are not unimportant. Books have a firm #2 with 811,044 live stories. Respectively, 461,311 and 150,708 belong to Harry Potter and Twilight. Let's dwell on these two for a moment. The HP fandom is marginally three times as big as Twilight. The underdog may claim this exists because Harry Potter has been a lot longer than Twilight. Making matters fair, that would mean Twilight would be as popular as Harry Potter if they were of the same age.

What's real and what's not? The first HP book has been released in 1997, 13 years from now. The first Twilight book has been released in 2005, 5 years from now. One might exclaim: "Ah hah! Two years is a very small time, so they must be equally popular!" Let's do the math. HP is 2.6 times older than Twilight. Had they been released at the same time, Twilight would now have 391,000 stories, 70,000 behind HP. How big/small of a difference is that? That's almost two LotR fandoms and the total number of new books released in Spain annually.

We return to weights. If the largest fandom in Anime/Manga, Naruto has 240,000 stories, a fourth of the total Anime/Manga, HP has 461,000, way over half of all Book-related fiction accessible on FFN. One may think, seeing that books are #2, more popular than Games, Comics, TV shows (some of which summed up), the world of fiction is into literature fandoms. False. Anime/Manga is popular because there are many fandoms. Books is big because there are many HP. In layman terms, it would be sensible to rename 'Books' into Harry Potter & Twilight, pop reads, which can only scratch the surface of, say, critically acclaimed classical literature. The audience on FFN could have been assumed as an active participant in literature fandoms. It is, however, an active participant in HP and Twilight-level literature fandoms.

Moving on to #3, which is TV shows at 580,596. Curiously, their outlook is similar to that of Anime, with a third of all fandoms having under 10 stories, and there being multiple weight leaders. 15 fandoms take the range from 40k to 10k. In Anime, 20 fandoms have over 10k stories. In Books, 4 fandoms have more than 10,000 stories.

One may want to run an economic monopoly concentration index formula on these numbers. In case it is viable, we present the number of fandoms in the top categories.

Anime/Manga: 1,062,835 stories; 955 fandoms; 20 fandoms have above 10,000 stories
Books: 811,044 stories; 1138 fandoms; 4 fandoms have above 10,000 stories
TV: 580,596 stories; 1013 fandoms; 15 fandoms have above 10,000 stories

Additional research is suggested for the curious: remove all the popular fandoms that add substantial weight to the category (have above 10,000 stories) and make an account of how much dead weight, or impopular fandoms, every top level category has.

At the moment, numbers suggest that the Anime/Manga is the healthiest fandom. Why? 1. It has more stories than others. 2. It has less fandoms. 3. It has more popular fandoms than Books and TV altogether. 4. It is least threatened by C&D (cease and desist) letters.

The fourth one is important. If a cease and desist letter is sent to, say Harry Potter fans, forbidding them to write fan fiction, books would drop dramatically from 811,044 to 349,733. If the same happened to an Anime/Manga fandom, the most loss it would have would be 240,635, less than a quarter of its size. Same applies to TV.

Let's not ignore other fandoms. Below is a limited table/list of top categories without crossovers. Crossovers were counted in our summary of all stories on FFN, but they produce confusing data in the way they are organised, belonging to several fandoms at the same time. The list below is made for clarity purposes.

Name - Story number - Fandom number - Fandoms above 10k - Top fandom

Anime: 1,062,835 - 955 - 20 - Naruto
Books: 811,044 - 1138 - 4 - Harry Potter
TV: 580,596 - 1013 - 15 - Buffy: The Vampire Slayer
Games: 269,261 - 614 - 6 - Kingdom Hearts
Cartoons: 192,918 - 320 - 5 - Teen Titans
Movies: 125,000 - 943 - 4 - Star Wars
Misc: 105,500 - 34 - 2 - Wrestling
Comics: 33,824 - 123 - 0 (8 above 1000) - X-Men
Plays: 15,300 - 85 - 0 (5 above 1000) - RENT

The following information is suggested for further study: how many fandoms are uninhabited (have below 10 stories), how much is that divided by the number of fandoms in the category?

An explanation should follow for the last two categories, Comics and Plays. Plays, for example, have been included much later than other top categories, setting them aside. Also, since they do not exceed 100,000 stories and it is rational to use a proportion size, the active fandom definition is adapted to them as 1000.

As always, present your questions, solicit ideas. This blog is interactive, and we will cover your topics of interest. Coming up next: how many writers does FFN really have?

Tuesday, 6 July 2010

FanFiction.Net story totals

Good news!

Our research venture has completed gathering data about site-wide story numbers. This post explains how many stories FanFiction.Net (FFN) really has.

The data in our evaluations has been generated based on the total number of stories posted on June 25th, 2010. The gathered data has been in processing since June 25th, 2010 till June 30th, 2010. We treat it as spatial or point collection; 5 days = 1 instance.

At the time of collection, there has been a total of 6,085,534 registered story entries, based on the newest registered story number in the Just In section on June 25th, 2010.[1].

However, we understand that some stories are deleted, and their ID number is not taken out from the database to be recycled for a new story.[2] Instead, the list carries on, and every newly posted story receives a number higher than the previous.

(An additional explanation for younger readers: you submit a story, and it gets an ID number in FFN's database, so everyone could easily find them. Let's say your ID is 123. If you know that, you can easily make a link without having to copy anything because all stories on FFN have http://www.fanfiction.net/s/*your story ID*. When someone posts a story right after you, their ID is 124, then 125, 126 and so on and so forth. Say, the site got to story number 140, but story 128 has been deemed illegal because it was about living actors, and deleted by the FFN staff, so nobody would sue. What do we have? We have numbers from 1 till 140, but 128 has been deleted. You can't know it has been deleted, by the way, because you're not the writer, and the only way to find out is to check. There are now 139 stories on the site even though it looks like there is 140. Thing is, on a site as big as FFN, you can't just guess how many numbers are 'blank' like that.)

It is the main reason for this analysis: the number FanFiction.Net presents to you is not the total number of stories it has at the moment, but a sum of all fanworks it had at every moment of time available to the public. The key term is 'available to the public' because FFN, according to their ToS, keeps server copies of user submissions. It is reasonable to assume that the real number of stories we can see now (dated June 25th, 2010) is not over 6 million.

We're implementing two methods to reach the data. The first is doing an account of all stories present in all ten top categories and crossovers such as this. Surely, it is a lot of very repetitive and dull labour, but it gives us the exact number, which is: 3,256,278 stories.

As of June 25th, 2010 there are 3,256,278 stories noted as accessible to the public on FanFiction.Net.

This is an accurate number, but it is not 100% of what the story number has been. Why? We made a top category account, without having to rummage through every single fandom, opening it like this. Why is this important? The number of stories in the top category window is always bigger (or even, when the fandom is inactive [has less than 50 stories]) than the real number of fictions one can browse inside the category. The researchers cannot provide you a firm answer on this discrepancy, but it may be attributed to dynamics of stories being deleted at a slower rate than they are added (for example, if you upload a story by mistake, and delete it, you raise the top category number of stories, and it stays above the real number even though you can no longer find the story, a server delay).

It has been determined that, depending on the fandom, the real number is from 0,19% to 5% smaller than the one provided. In large categories, the weight of which forces the researchers to consider them, this number teeters closer to the first value. Now, it might not seem substantial, but Twilight with its 150,000 uploads may have up to 2000 dead stories counted as alive every day. To be completely fair to the estimate, we are multiplying the number by an arbitrary 0.987 coefficient, which best describes the current number of stories, as seen in ten most popular, story-wise, subcategories of Books, Anime and others, except crossovers. Since they make up the trending bulk of FFN, their averages have been considered.

Here is a better estimate, statistically not different than the first, but more exact for the human eye: 3,213,946.

What does that say to you? FanFiction.Net is only 54%-53% (without/with 0.987 coefficient) of what it appears to the layman, with the remaining 46%-47% being deleted content. As such, you may take it that every second story is destined to be deleted, and out of every two stories You post only one will survive (statistically).

What about the second method? Aside from these real numbers taken in raw, the research includes a sample of 1100 randomly generated story IDs with a range [1;6085534], which allows the research to continue with case study at a 3% error margin and a 95,34% confidence level. The survivability estimate taken from the sample size is 55%, which is within the 3% acceptable error and statistically identical to 54%-53%, received with the help of raw data. For future studies, this means our method of sampling follows the general population's characteristics.

In conclusion, there are 3,213,946 stories on FanFiction.Net at the time of our study, and nearly half of all stories posted will sooner or later disappear. How soon? Come back later to find out!

Should you require additional data, requests can be made in the comments, emailed to Lord Kelvin or posted in the Literate Union forum. The list used in our sample can be found here:
http://www.usbupload.com/23228_FFNstatsdatadoc.usb
http://www.usbupload.com/23227_samplelinksFFNdoc.usb

Tuesday, 29 June 2010

Welcome!

This blog has been created to store all the data Lord Kelvin and other curious people banded under the FFN Research flag have or are collecting about FanFiction.Net. Expect analysis, various facts and queries in every informative post.

Finding hard data related to fandom and fan fiction has been...difficult as of late. Anything available online was either inaccurate, obsolete or cost $600 for a peek. This was unacceptable, so we took matters into our own hands. Our purpose is to become a reputable basis for every worthwhile query. By fans. For fans. We all hope our efforts and studies are going to inspire you, dear readers, to join this cause.

If you have any requests, post them in comments below. Alternatively, you may send feedback and ideas to Lord Kelvin.