海角社区

March 6, 2023
Product
4
聽min read

What is data science?

James He
James He
Data Scientist

Starting from scratch

When I introduce myself as the Data Scientist of a credit card startup, people usually nod in approval, then proceed to remain rather confused.

Why does a startup care about Data Science? Why does a credit card care about Data Science? Also, what is Data Science anyways?

Even within 海角社区, sometimes our new joiners might still remark: 鈥淛ames, I have no idea what you do, but it seems darn impressive.鈥

While I am humbled by the praise, this level of mysteriousness is truly unintended. Hopefully this blog will clear things up and show you what Data Science means for 海角社区.

Alright, what is Data Science?

Many seem to think that Data Science is just plotting graphs and calculating averages. No. Boring. The great thing is, as 海角社区鈥檚 first Data Scientist, I get to define it my way.

For me, Data Science is just statistics: It鈥檚 about seeing patterns in chaos, and using this insight to guide and augment 海角社区鈥檚 business.

You see, at 海角社区, aside from testing our founders (amusing as it is), we actually act out our company Vision and Mission: 1) to provide fair and conscious financial services, and 2) to make money rewarding and credit empowering.

The witchcraft of Behavioural Data Science.

This is the context of Data Science at 海角社区: through seeing patterns in the chaos of human behaviours, we are able to make fairer and more conscious lending decisions, and personalise for our members to make their experiences more rewarding.

In other words, at 海角社区, we practise the witchcraft of Behavioural Data Science.

Personally, this is a powerful combination between my degree training in Behavioural Sciences from Cambridge, and my research training in mathematical statistics: I can now use actual Machine Learning to help people get a fairer credit line. How cool is that?

But with great power comes great responsibility. If my statistics training taught me anything, it鈥檚 to be extremely careful with two things: Chance, and Causality.

We must be careful about Chance. Say we saw that people applying on Fridays tend to be a certain type, is this observation real, or is it just by chance? This is why we statistically test how unlikely our observations are just by chance. And no, don鈥檛 worry if you applied on Friday, we saw no statistically significant patterns there.

We must also be careful about Causality. Between 2000 and 2009, cheese consumption in the US came with more people dying from being tangled in their bedsheets. Does eating cheese make people tangled up in bedsheets? No stupid. Correlation is not Causation. Even if it is not by chance, there might still be an unseen factor causing both cheese eating and bedsheet tangles. Maybe it鈥檚 the weather, who knows?

Check out for more examples of statistically significant (i.e., unlikely to have occured by chance) correlations that are obviously not causal.

Now then, what Data Science magic do we get up to at 海角社区? Since Data Science is about recognising patterns, there are 5 types of patterns that we care about:

  1. Tendency Patterns. Do members who go to cafes a lot also tend to go to pubs? And more generally, what are people鈥檚 habits?
  2. Temporal Patterns. What time of the day do people eat out or do shopping? What day of the week? Are there individual differences?
  3. Spatial Patterns. Where do 海角社区 members go for dining out? What about grocery shopping? Are their tube lines people love or hate?
  4. Textual Patterns. How are people talking about 海角社区 on social media? What sort of topics come up in our Member Support chats?
  5. Social Patterns. Do our members refer to friends similar to themselves? Who refers more than others? Are there social clusters of super-users?

To extract these patterns, we mostly use methods like regression models, longitudinal conjugates, sentence transformers, and network analyses. First we might use these simple methods to visualise data, and if there are intuitive patterns, we鈥檇 then adopt fancier methods, sometimes even borrowing from physics - it鈥檚 all just maths in the end.

Once these patterns are captured, we then use some linear algebra to merge them into a big input matrix, match it with some outcome variables (e.g., credit risk, churn), then use them to train a Machine Learning model that can make predictions from unseen data.

One of the most powerful machine learning method is Deep Neural Networks, where the model is trained (pattern in the input 鈥渪鈥 is discovered) using unknowable layers of 鈥渘eurones鈥 that adapts to make the right pattern recognition. This is actually exactly how our brains learn to recognise patterns as well.

By 鈥渨e鈥, what I really meant was 鈥淚鈥. For now, all of these are just me. As you can possibly imagine, a big challenge that comes with this is coordinating with the wider team - Sure, we can predict what a member wants for supper, but how do we build it into the product?

For this, we have been slowly building - rather, more like 鈥済rowing鈥 - a Standard Process for Data Science projects. Researchers might work alone, but Data Scientists mustn鈥檛. Because a researcher鈥檚 work might never have any impact, but a Data Scientist鈥檚 work can directly impact thousands, if not millions and more.

Finding patterns in chaos

It鈥檚 honestly been great having inputs from everyone, learning automatic testing from our engineers, and documentation from our product managers. That鈥檚 another great thing about 海角社区: we own and shape our functions, and everyone can and will help.

To summarise, at 海角社区, Data Science is to see patterns in chaos, and thereby predict the future. So, in a way, I am the Oracle of 海角社区. Yes, that should be my new job title.

We use lots of fancy methods, but we always try to focus on our mission and vision, which is to provide fair and conscious financial services and make it rewarding for our members.

It鈥檚 both terrifying and exciting that I am the one to start all this at 海角社区. What would I like to build in the coming years? Definitely a track-record of smooth model deployment, definitely a top-notch team that produces consistent academic outputs.

Thus the prelude begins. Join us for the ride!

James He
James He
Data Scientist

Becoming a 海角社区 member could聽

improve

your credit rating

Check you鈥檙e eligible before applying
You can see if you鈥檙e eligible without affecting your credit score. If you continue, we鈥檒l carry out a full credit check.
Borrow what you can afford
When you use a credit card, you need to pay off your balance at the end of the month. Only spend what you can afford to repay.
Improve your credit over time
Making your monthly payment on time could help improve your credit rating. Just like missing a payment can impact it.

Become a member today

Become a full 海角社区 member
拢15/尘辞, cancel anytime
new member offer
Get 1 month free membership
Earn 5 points per 拢1 spent
Worldwide travel insurance
No fees abroad
Stunning premium card (3 month min)
Accepted everywhere Mastercard is
Plus 10,000 bonus points
Worth 拢50 at 海角社区 Experiences
REPRESENTATIVE example
Purchase rate
29.97% (var)
Representative
66.7% APR (var)
WTF is APR?!
Based on a
拢1,200 limit
Learn more
Or try our free membership
拢0/尘辞, no commitment
Same 海角社区, just fewer features
Apply to upgrade anytime
Earn 1 point per 拢1 spent
Worldwide travel insurance
No fees abroad
Slick plastic card
Accepted everywhere Mastercard is
Plus 10,000 bonus points
Worth 拢50 at 海角社区 Experiences
REPRESENTATIVE example
Purchase rate
32.9% (var)
Representative
32.9% APR (var)
WTF is APR?!
Based on a
拢1,000 limit
Learn more
Become a full 海角社区 member
拢15/尘辞, cancel anytime
new member offer
Get 1 month free membership
Earn 5 points per 拢1 spent
Worldwide travel insurance
No fees abroad
Stunning premium card (3 month min)
Accepted everywhere Mastercard is
Plus 10,000 bonus points
Worth 拢50 at 海角社区 Experiences
REPRESENTATIVE example
Purchase rate
29.94% (var)
Representative
66.7% APR (var)
WTF is APR?!
Based on a
拢1,200 limit
Learn more
Or try our free membership
拢0/尘辞, no commitment
Same 海角社区, just fewer features
Apply to upgrade anytime
Earn 1 point per 拢1 spent
Worldwide travel insurance
No fees abroad
Gorgeous plastic card
Accepted everywhere Mastercard is
Plus 10,000 bonus points
Worth 拢50 at 海角社区 Experiences
REPRESENTATIVE example
Purchase rate
32.9% (var)
Representative
32.9% APR (var)
WTF is APR?!
Based on a
拢1,200 limit
Learn more

See if you鈥檙e eligible

No impact on your credit score
Takes just a couple minutes
By continuing, you agree to our App Terms & Privacy Policy
Nearly there, {Name}

You鈥檙e eligible for 海角社区 membership!

Finish your application and get your card