Youth Soccer Rankings ?

Noticed a couple of teams/clubs doing this at Surf Cup.
A couple teams also applied to the tournament before they changed team names. Same team just under a prior name. Noticed this most with teams that went from EA to MLS II (which apparently is now being called MLS AD by some clubs?).
 
That's the same thing I said, no? Why are you arguing? Most teams play 80+% games at full length which weights the data for full length games. If it was possible to normalize per 90, then the ratings wouldn't shift, but we both agree there are so many other variables that it's not worth it and directionally it evens out over time.

There's definitely more agreement than disagreement.

Score one for the algorithm in general, but the result of this accurate prediction is that will be slightly less accurate for future predictions.

I was interpreting this as stating that since SR was including these tournament results and yet *not* accounting for game length differences, it's making the predictions less accurate - hurting stronger team ratings and helping the weaker team ratings.

It doesn't make SR or its predictions any less accurate or more accurate. SR is going to represent how a team performs - in all circumstances, whether those circumstances are completely identical, or with varying factors along the way. Its rating is going to be a prediction for how the team performs on its very next game - with no knowledge of what circumstances surround the next game. A team is going to get a rating, for the recorded soccer games it plays, regardless of circumstances.

A team that plays 80% full-length games, compared to a team that plays 80% short games, has a somewhat different environment to achieve its game scores, agreed. But neither team has an "advantage", or IMO certainly not an unfair/unjust advantage - their rating will represent the scores that they would expect to get in the very next game they play (equivalent to calculated performance of all of the games they've already played).
 
There is one issue with the ranking app that could be viewed as more of a feature depending on who you are.

The Ranking App weights games (gives extra credit) if teams playing each other have a similar ranking. What this means is if you're in a league with all very similar ranked teams the games you play against each other will have a higher value then normal. Usually leagues will have teams with a variance in ranking. Socal girls ECNL is a little different, it has a large number of highly ranked teams. What happens is because they play each other a lot in league the games turn into a feedback loop with extra credit on top.

Although I cant prove it. (actually I probabaly could but it would take going though old data) This is why in Playoffs Socal ECNL teams seemed like they were going to steamroll their way through the finals if you were just looking at the ranking app. But in many cases teams with a lower ranking somehow prevailed. Those teams were likely in a regional league with a variance of good and bad teams. During the season they were getting the wins but they werent getting the weighted / extra credit.

Adding weight to games of similarly ranked teams is likely being done by the ranking app so teams can't just win 10+-0 over and over against lower ranked teams to get a high ranking.
 
There is one issue with the ranking app that could be viewed as more of a feature depending on who you are.

The Ranking App weights games (gives extra credit) if teams playing each other have a similar ranking. What this means is if you're in a league with all very similar ranked teams the games you play against each other will have a higher value then normal. Usually leagues will have teams with a variance in ranking. Socal girls ECNL is a little different, it has a large number of highly ranked teams. What happens is because they play each other a lot in league the games turn into a feedback loop with extra credit on top.

Although I cant prove it. (actually I probabaly could but it would take going though old data) This is why in Playoffs Socal ECNL teams seemed like they were going to steamroll their way through the finals if you were just looking at the ranking app. But in many cases teams with a lower ranking somehow prevailed. Those teams were likely in a regional league with a variance of good and bad teams. During the season they were getting the wins but they werent getting the weighted / extra credit.

Adding weight to games of similarly ranked teams is likely being done by the ranking app so teams can't just win 10+-0 over and over against lower ranked teams to get a high ranking.

This is a misunderstanding and mischaracterization of how SR works, and how it rates teams. There is no extra credit for close games, there is no bonus for playing similar teams. Your belief about leagues with similar ratings compared to more diverse ratings being definitive in helping teams "achieve" a rating - is hooey for several reasons. One of them is that if you look at the delta between top teams and bottom teams, it's about as wide in the top level in the country, down to Flight 4. As long as teams in a bracket all have to play each other - there's going to be a relatively expected distribution of results.

There are plenty of parts of their algorithm to take account for limiting blowouts in affecting rating - so beating a team 10-1 is probably only a little better than 4-1, if at all. But none of this is due to "fairness", or to give a bonus to good teams, or a penalty to bad teams. The entirety of the algorithms is so as they are applied to the thousands and thousands of game results, the expected games chosen to win - goes up. Anything at all that can be done, in transforming the given data for game results, into predictions that result in the highest expected wins - is being done. It's the reason that game results of games that go to PKs are stored as ties, and not a win for the winning team. According to their data, the winner of a PK turns out to be not predictive at all about which team is actually more likely to win going forward. It's a coin flip - so it isn't used.

All of this is checked continuously and recursively, so if the incoming data starts to change a bit, they are tweaking the parameters to keep that expected win percentage as high as possible. We (and they) might have intuitive beliefs about how to weight the data, how much more a game 2 weeks ago is worth compared to 8 weeks ago, how much better a team that wins by a 2 goals compared to a goal, and we might even be on the right track. But it's even more effective to have all of that intuition, and guesses, and implement them on real data across thousands and thousands of games - to get instant results about whether the change will predict more wins or not.

That's essentially all SR does - everything else is UI and window dressing.

I know you don't believe me. But if you're actually curious about understanding how things work, talk to Mark. Talk to their team.
 
This is a misunderstanding and mischaracterization of how SR works, and how it rates teams. There is no extra credit for close games, there is no bonus for playing similar teams. Your belief about leagues with similar ratings compared to more diverse ratings being definitive in helping teams "achieve" a rating - is hooey for several reasons. One of them is that if you look at the delta between top teams and bottom teams, it's about as wide in the top level in the country, down to Flight 4. As long as teams in a bracket all have to play each other - there's going to be a relatively expected distribution of results.

There are plenty of parts of their algorithm to take account for limiting blowouts in affecting rating - so beating a team 10-1 is probably only a little better than 4-1, if at all. But none of this is due to "fairness", or to give a bonus to good teams, or a penalty to bad teams. The entirety of the algorithms is so as they are applied to the thousands and thousands of game results, the expected games chosen to win - goes up. Anything at all that can be done, in transforming the given data for game results, into predictions that result in the highest expected wins - is being done. It's the reason that game results of games that go to PKs are stored as ties, and not a win for the winning team. According to their data, the winner of a PK turns out to be not predictive at all about which team is actually more likely to win going forward. It's a coin flip - so it isn't used.

All of this is checked continuously and recursively, so if the incoming data starts to change a bit, they are tweaking the parameters to keep that expected win percentage as high as possible. We (and they) might have intuitive beliefs about how to weight the data, how much more a game 2 weeks ago is worth compared to 8 weeks ago, how much better a team that wins by a 2 goals compared to a goal, and we might even be on the right track. But it's even more effective to have all of that intuition, and guesses, and implement them on real data across thousands and thousands of games - to get instant results about whether the change will predict more wins or not.

That's essentially all SR does - everything else is UI and window dressing.

I know you don't believe me. But if you're actually curious about understanding how things work, talk to Mark. Talk to their team.
How about if I just quote from the ranking apps website...


"
Q: How are the team rankings calculated?

A: We pull from over 400 official sources in order to provide the most comprehensive results. We look at the goal difference in the last 20 game played in the last 12 months, including games against older teams. There is a line below the last result that counts towards your ranking. More recent games and games against a similar team are given higher weight. To improve your rating, you have to do better than predicted. Your rating will go down if you do worse than predicted.
 
Fair - but there is no "bonus" for playing similar teams, and a rating/ranking for a team isn't improved in any way by playing similar teams. It's just that the results coming from playing teams that are closer, are more predictive for how the team will perform in future games - so those results tend to be weighted more than both teams that are quite further apart, as well as game results that are further apart. In a hypothetical bracket where the top team and bottom team are only separated by a goal - it's not like the teams get any benefit for being in that bracket, other than their predicted results will implicitly be more accurate compared to a bracket that has a separation of 5 goals. Given the same number of games, they will be more predictive - not higher (or lower) rated. Any and all calculations made on the game data to create the ratings are in support of optimizing win percentage for the predictions (i.e. did the higher rated team beat the lower rated team) - and nothing else.

It's a completely different setup than one that is set up to award points/bonuses for tournament wins, game wins, etc.
 
Fair - but there is no "bonus" for playing similar teams, and a rating/ranking for a team isn't improved in any way by playing similar teams. It's just that the results coming from playing teams that are closer, are more predictive for how the team will perform in future games - so those results tend to be weighted more than both teams that are quite further apart, as well as game results that are further apart. In a hypothetical bracket where the top team and bottom team are only separated by a goal - it's not like the teams get any benefit for being in that bracket, other than their predicted results will implicitly be more accurate compared to a bracket that has a separation of 5 goals. Given the same number of games, they will be more predictive - not higher (or lower) rated. Any and all calculations made on the game data to create the ratings are in support of optimizing win percentage for the predictions (i.e. did the higher rated team beat the lower rated team) - and nothing else.

It's a completely different setup than one that is set up to award points/bonuses for tournament wins, game wins, etc.
A while back I posted that Pats Pre ECNL’s mercurial rise through the rankings looked odd. I suspected that Soccer ranking had an algorithm that give more weight to a team with ECNL or MLS in their team name. You chewed me up for it. But looks like this is it.
 
A while back I posted that Pats Pre ECNL’s mercurial rise through the rankings looked odd. I suspected that Soccer ranking had an algorithm that give more weight to a team with ECNL or MLS in their team name. You chewed me up for it. But looks like this is it.
The ranking app doesnt care about the name of the league. With Pats there is a little going on that does give them a boost. In Socal ECNL league theres several highly ranked teams. If they beat or tie these teams it will push them up in rankings assuming they get more than the expected number of goals. However, once Pats are highly ranked themselves the ranking app weights games with similar ranking over those that arent similar. What this means is ties against an equally highly ranked team will help to maintain a high ranking more than they normally would. This is the feedback loop I was talking about. It artificially keeps Socal teams ranked high because of the number of highly ranked teams they play every week in league. Just to be clear I'm not saying that they wouldn't be ranked highly if the ranking app didnt weight playing similar ranked teams. This isnt the case. I am saying that they're ranked a little higher than their ranking would be without weighting similar ranked games.

If the ranking app didnt weight games with a similar ranking more than those without a really good team could manipulate the ranking app by blowing out terrible teams 10+-1 over and over.

Teams in leagues that have a mix of ranked teams in their league get screwed a little in ranking. But this isnt always a bad thing because in playoffs or tournaments they'll be better than their ranking and likely seeded against not as good teams. (initially) Once they blow out a good team the ranking app will quickly adjust their ranking up because they scored several more than the expected number of goals.
 
It's the reason that game results of games that go to PKs are stored as ties, and not a win for the winning team. According to their data, the winner of a PK turns out to be not predictive at all about which team is actually more likely to win going forward. It's a coin flip - so it isn't used.

Yeah, we just lost a final on PKs to a highly ranked team and the app unfortunately recorded the 5-4 penalty score, not the 1-1 tie, and we also beat another highly ranked team in the group stage, but that result ended up in the Unranked section due to a slight name change for the opposition, so we didn't gain anything in the app from either result.

That said, while I find the Ranking App rankings a fun distraction, it's just a ballpark guide, particularly for tournaments where inter-league play is in effect. The real value is in looking at the previous results with an educated eye.

A useful tweak might be to also offer a ranking for your team's current league, though I guess we have that already and it's called the League Table, LOL.
 
Yeah, we just lost a final on PKs to a highly ranked team and the app unfortunately recorded the 5-4 penalty score, not the 1-1 tie, and we also beat another highly ranked team in the group stage, but that result ended up in the Unranked section due to a slight name change for the opposition, so we didn't gain anything in the app from either result.

That said, while I find the Ranking App rankings a fun distraction, it's just a ballpark guide, particularly for tournaments where inter-league play is in effect. The real value is in looking at the previous results with an educated eye.

A useful tweak might be to also offer a ranking for your team's current league, though I guess we have that already and it's called the League Table, LOL.
If some leagues stay BY and others switch to SY the Ranking App will likely need to break out the different groupings.

ECNL, USYS, SOCAL, etc would be in the SY grouping.

MLSN, GA, etc would be in the BY grouping.

This is probabaly what both leagues want and could be why MLSN and GA haven't announced anything.
 
A while back I posted that Pats Pre ECNL’s mercurial rise through the rankings looked odd. I suspected that Soccer ranking had an algorithm that give more weight to a team with ECNL or MLS in their team name. You chewed me up for it. But looks like this is it.

I remember this incident differently. There was a clear glitch with the rating, you were apoplectic about it, several people (including me) said wait a day or two for it to resolve itself, and that's exactly what happened.

There is never anything to do with any name of the team, bracket they are playing in, or any other factor. I don't know how many times people need to be told - the only thing that matters, is the game scores - and the team entity that they are attached to.

The ranking app doesnt care about the name of the league. With Pats there is a little going on that does give them a boost. In Socal ECNL league theres several highly ranked teams. If they beat or tie these teams it will push them up in rankings assuming they get more than the expected number of goals. However, once Pats are highly ranked themselves the ranking app weights games with similar ranking over those that arent similar. What this means is ties against an equally highly ranked team will help to maintain a high ranking more than they normally would. This is the feedback loop I was talking about. It artificially keeps Socal teams ranked high because of the number of highly ranked teams they play every week in league. Just to be clear I'm not saying that they wouldn't be ranked highly if the ranking app didnt weight playing similar ranked teams. This isnt the case. I am saying that they're ranked a little higher than their ranking would be without weighting similar ranked games.

Well, sure - but again - it's not a bonus or artificial boost for similar-rated teams, that makes the future predictions less accurate. It's set up that way because of the opposite - it instead makes future predictions as accurate as they can be.

I think we need to separate the definition of "better rating" into meaning higher rating / higher quality team, and "better rating" - more impactful on a team's rating. They have completely different meanings, and I think interchanging them may be causing much of the lack of clarity in these discussions.

Take 3 games, for the same team. They play 3 different teams. First one they are 4 goals stronger, and they win 4-0. 2nd one they are very similar, and they win 4-0. 3rd one that are 4 goals weaker, yet they still win 4-0. The first game will have an effective differential of 0 goals. The rating for the team will not be expected to change much at all - both because of the differential - but also because the game won't be weighted heavily. The 2nd one, the effective differential is 4 goals - which will be weighted heavily as the teams ratings are close. The 3rd one, the differential is an astounding 8 goals (congrats to the team), but while it will likely be helpful to the rating, the result will not be weighted as highly to their rating, compared to the 2nd game. Even though it's a fantastic win - it's not as predictive on how future games will go.

If a team consistently plays teams that are similar to themselves, the weightings for each individual game (and their results) will be similar as well. If a team instead plays teams all over the board - it can take a longer time for results of games against dissimilar teams, to make a dent in the rating earned by playing similar teams.

If the ranking app didnt weight games with a similar ranking more than those without a really good team could manipulate the ranking app by blowing out terrible teams 10+-1 over and over.

Sort of. If a team is 5 goals stronger than its opponent, and it keeps shellacking them 10-0 over and over again, they will bend to 10 games apart in strength. It would just take a very significant amount of games to do so. Again - even in this case I still wouldn't define it as manipulation - as what would be happening to the rating is a fair representation of how games are being scored, and the best prediction for how those teams would be expected to perform in the very next game.

Teams in leagues that have a mix of ranked teams in their league get screwed a little in ranking. But this isnt always a bad thing because in playoffs or tournaments they'll be better than their ranking and likely seeded against not as good teams. (initially) Once they blow out a good team the ranking app will quickly adjust their ranking up because they scored several more than the expected number of goals.

The ratings can move quite quickly, yes. If a team isn't getting game results that align with their current rating, one might be surprised how quickly they can either move up or down in rating (and therefore ranking). It doesn't take half a season or more for things to adjust. More recent games are much more heavily weighted than older games, even from only a few weeks back.

Yeah, we just lost a final on PKs to a highly ranked team and the app unfortunately recorded the 5-4 penalty score, not the 1-1 tie, and we also beat another highly ranked team in the group stage, but that result ended up in the Unranked section due to a slight name change for the opposition, so we didn't gain anything in the app from either result.
That's crappy that the tournament entered the score that way. Does the web page of results make it clear that it was a 1-1 game with 5-4 penalty kicks, or does it just show a 5-4 score? If there is a problem there, and you think the app should have scraped it differently - please send a link of the web page results to support. They have to clean up that engine continuously as results pages continue to morph, and often they only are alerted to do so when these anomalies are raised to them. If it is a problem instead with how the tournament entered on their own site and you can get them to correct it, just ping SR and they will rescan it. Some tournaments are more responsive to these changes/corrections, some (maybe most?) couldn't care less and aren't likely to ever correct errors. Some of them care about the electronic results so little that the championship game results never actually get posted to the app after the tournament ends.

For the second case, are you a pro user in the app? If so, and you are sure that the name change for that team (or any team), still represents the same team that is already in the database - add the data source yourself. You can correct anything you see across the board - not just your own team. But be sure, and be careful - as if there are changes made that someone else then objects to and reports - you may need to defend why you made that change, and ultimately lose editing privileges if it happens enough times.

That said, while I find the Ranking App rankings a fun distraction, it's just a ballpark guide, particularly for tournaments where inter-league play is in effect. The real value is in looking at the previous results with an educated eye.

A useful tweak might be to also offer a ranking for your team's current league, though I guess we have that already and it's called the League Table, LOL.
It's not perfect - and all it is able to ever to is to try and predict the future based on what's already happened. It does a darned good job of that, and people tend to underestimate how accurate it turns out to be. Just look at any of the threads here for things like Surf Cup predictions, State Cup predictions, or even just the game history of one's own team.

We create the tables you are suggesting for several team ages in the club, and update them every month or two. It provides a clearer picture than just the current season's standings bracket, about how each team is growing compared to its peers in bracket and beyond. It really helps all parties understand how a team is developing from a performance standpoint.

One way to think of it - is that if a set of teams, across their full history, ever played a single game outside of bracket (no tournaments, no playoffs, no anything), at the end of the season the standings bracket and an SR ratings bracket would look very, very similar. SR is just a representation of that team's performance, just as the standings bracket is. It would be somewhat more predictive than the standings bracket, as that is ranked purely by W/L/D, while SR ratings instead incorporate how significant each win or loss tends to be - but they would be awful close in most scenarios. It's just that in the real world, teams are not limited to a single bracket, and you probably don't want to wipe the slate clean every season to determine how strong an opposing team is - game history across all events and prior seasons can be helpful, and SR gets use out of ones up to 12 mo old, though the weighting of the older ones gets lower and lower.
 
If some leagues stay BY and others switch to SY the Ranking App will likely need to break out the different groupings.

ECNL, USYS, SOCAL, etc would be in the SY grouping.

MLSN, GA, etc would be in the BY grouping.

This is probabaly what both leagues want and could be why MLSN and GA haven't announced anything.

No, and no. There won't be multiple groupings after the transition, and the lack of announcement by those leagues has nothing to do with SR. Team entities will be put in the closest relevant grouping, just like they are now - for groupings that don't match entirely across leagues.
 
For the second case, are you a pro user in the app? If so, and you are sure that the name change for that team (or any team), still represents the same team that is already in the database - add the data source yourself. You can correct anything you see across the board - not just your own team. But be sure, and be careful - as if there are changes made that someone else then objects to and reports - you may need to defend why you made that change, and ultimately lose editing privileges if it happens enough times.

Funnily enough, I tried that and it said the other team was locked for changes. I'm not locked, as I made other changes, so my assumption was this team didn't want that loss on their permanent record or something: this was their first and only tournament under the slightly different name, but definitely the same team.
 
That is curious. I've come across that from time to time, and asked Mark for clarification on the why for each team, whether it was a glitch or was intentionally locked for changes. There are a handful of teams like that in my area (only 2 or 3), that have a spotty/incomplete game history, are locked, and they themselves don't do a good job of adding game results. Part of me wants to think that it's intentional and they are trying to cherrypick results, but the more likely scenario is that the team was being messed with by a third party, and they themselves couldn't be bothered.
 
No, and no. There won't be multiple groupings after the transition, and the lack of announcement by those leagues has nothing to do with SR. Team entities will be put in the closest relevant grouping, just like they are now - for groupings that don't match entirely across leagues.
How would you put both BY and SY groups into a single grouping? Would BY be playing up with SY groupings or would you have SY playing up with BY groupings? By not splitting BY and SY into their own groups you'd be giving a huge advantage to whichever league is the oldest in the grouping.


You really need to think though your statements before declaring them correct.
 
A while back I posted that Pats Pre ECNL’s mercurial rise through the rankings looked odd. I suspected that Soccer ranking had an algorithm that give more weight to a team with ECNL or MLS in their team name. You chewed me up for it. But looks like this is it.

I remember this incident differently. There was a clear glitch with the rating, you were apoplectic about it, several people (including me) said wait a day or two for it to resolve itself, and that's exactly what happened.

There is never anything to do with any name of the team, bracket they are playing in, or any other factor. I don't know how many times people need to be told - the only thing that matters, is the game scores - and the team entity that they are attached to.

I was pretty sure that was the case, but I went back to confirm it. Read it through again yourself if you like.
 
How would you put both BY and SY groups into a single grouping? Would BY be playing up with SY groupings or would you have SY playing up with BY groupings? By not splitting BY and SY into their own groups you'd be giving a huge advantage to whichever league is the oldest in the grouping.


You really need to think though your statements before declaring them correct.

We've already had this debate. Just scroll up. We (or at least I) agreed to see what SR is actually going to implement if and when MLS N stays on current schedule, and the rest shift. One of us will be wrong.
 
Funnily enough, I tried that and it said the other team was locked for changes. I'm not locked, as I made other changes, so my assumption was this team didn't want that loss on their permanent record or something: this was their first and only tournament under the slightly different name, but definitely the same team.
I also wonder if there were too many users making adhoc changes to the team. I've seen a few teams locked and it again was due to teams changing their name but still being the same team. The examples I saw were EA2 teams changing name to EA and EA teams changing name to MLSN2. Over time I guess the results for each will normalize to the teams performance.
 
On the girls side, it doesn't matter. 5-7 months doesn't buy you much. The top ranked 2009 team is only one point higher than the top ranked 2011. That's two whole years of age difference, for one point of expected goal differential. No need to worry yourself sick over a 5 month shift.
 
On the girls side, it doesn't matter. 5-7 months doesn't buy you much. The top ranked 2009 team is only one point higher than the top ranked 2011. That's two whole years of age difference, for one point of expected goal differential. No need to worry yourself sick over a 5 month shift.
I'm not worried about it + I agree with you.

However some parents, club owners, leagues, etc, etc will howl about a 6 month difference, until it gets changed or maybe a handicap was added to the view to normalize. But, the handicap will then be open to interpretation + bickering. In the end the ranking app will split SY and BY groupings appart just to stop the annoyance. Splitting BY + SY appart is just a dashboard view. It doesnt change any of the math going on.

Using 2011 and a BY mentality there would be a 2011 and 2012.5 grouping.

Using 2011 and a SY mentality there would be an 8th grade and 7.5 grade grouping.
 
Back
Top