Hello how do you handle N+1 problem I know about data loader kotlinlang #graphql-kotlin

Hello, how do you handle N+1 problem? I know about...

neetkee

06/25/2020, 4:01 PM

Hello, how do you handle N+1 problem? I know about data loaders, but they have a significant problem - they can’t be chained https://github.com/graphql-java/graphql-java/issues/1078 For example, if I have some field, that relies on two data loaders, it may not work. @Dariusz Kuc I’ve heard you don’t use them, so how do you handle it? Can you provide some code? I think it might be a useful example

Lenny

06/25/2020, 4:59 PM

yeah i asked this question a while back: https://kotlinlang.slack.com/archives/CQLNT7B29/p1589478150040100 … i’m currently using dataloader in a limited fashion and would love to see some example code for caching/batching with coroutines

Dariusz Kuc

06/25/2020, 5:56 PM

unfortunately don’t have a good answer to that

Dariusz Kuc

06/25/2020, 5:57 PM

in general it resolves around properly structuring your code to avoid redundant calls

Dariusz Kuc

06/25/2020, 5:57 PM

(was thinking about doing a post that goes into more detail about it)

Dariusz Kuc

06/25/2020, 5:59 PM

for example lets assume you have simple schema like

Copy code

type Query {
  products: [Products]
}

type Product {
  description: ProductDescription
  price: ProductPrice
  reviews: [ProductReview]
}

Dariusz Kuc

06/25/2020, 6:01 PM

so traditionally you would do something like

Copy code

fun products(): List<Product> {
   // fetch product ids
   // fetch description
   // fetch price
   // fetch reviews
   return listOf(products)
}

or something like that

Dariusz Kuc

06/25/2020, 6:02 PM

you could destructure it so each

Product

would expose those fields as functions

Copy code

class Product {
  fun description(): ProductDescription { ... }
  fun price(): ProductPrice { ... }
  fun reviews(): List<ProductReview> { ... }
}

Dariusz Kuc

06/25/2020, 6:03 PM

but then you end up -> 1 call to get product IDs and then each one of the products individual calls

Dariusz Kuc

06/25/2020, 6:03 PM

so it becomes problematic if your underlying services don’t support batch apis

Dariusz Kuc

06/25/2020, 6:04 PM

afaik data loader can be used to batch those calls but as you mentioned it might not work in all cases

Dariusz Kuc

06/25/2020, 6:05 PM

there is also another case when multiple fields within given

Product

are calculated based on some common data -> if you expose those fields as functions (so they are only calculated when requested) how do you share the common data?

Dariusz Kuc

06/25/2020, 6:08 PM

in this case the pattern we saw that was pretty useful was to use deferred variables to invoke common service, e.g.

Copy code

class Product {
  private val deferredServiceData = async { 
    slowCallGoesHere
  }

  fun description(): ProductDescription {
    val sharedData = deferredServiceData.await()
    // other logic
    ... 
  }

  fun price(): ProductPrice {
     val sharedData = deferredServiceData.await() // reuses same
     // price logic
     ...
  }
  
  ...
}

Dariusz Kuc

06/25/2020, 6:09 PM

but yes this can all get somewhat complex pretty fast

Dariusz Kuc

06/25/2020, 6:10 PM

we just started a discussion about how to simplify this (guess good timing?)

neetkee

06/25/2020, 6:27 PM

so it becomes problematic if your underlying services don’t support batch apis

but what if this underlying services support batching api? For example if price field requires a network call to ReviewService, and this service has a method like this:

Copy code

fun getReviews(productIds: List<Long>)

neetkee

06/25/2020, 6:28 PM

In this case, we still don’t have a solution to this problem, since that would be a request for every product?

neetkee

06/25/2020, 6:30 PM

In case of dataloader, it can track what ids were requested, and batch them together

neetkee

06/25/2020, 6:32 PM

then, we may need this reviews in some other query, that already have information about product, for example

Copy code

analyttics {
    reviews {
      star
    }
  }

Dariusz Kuc

06/25/2020, 6:34 PM

well if underlying service does support batching you would do the first one

Copy code

fun products(): List<Product> {
   // fetch product ids
   // fetch description
   // fetch price
   // fetch reviews
   return listOf(products)
}

neetkee

06/25/2020, 6:34 PM

if we have a query like this

Copy code

analytics {
    reviews {
      star
    }
  }
  products {
    reviews {
      star
    }
  }

that will be handled too, and there will be only 1 network call

Dariusz Kuc

06/25/2020, 6:34 PM

yes but again it depends how you structure your graph

Dariusz Kuc

06/25/2020, 6:35 PM

in our use cases

analytics

and

products

would generally reside in separate microservices

neetkee

06/25/2020, 6:40 PM

Do you primarily use functionDataFetcher in your projects? If our

product

needs to access external api in order to fetch

reviews

, will be some

reviewsClient

injected right into it?

Dariusz Kuc

06/25/2020, 6:42 PM

yes

neetkee

06/25/2020, 6:49 PM

maybe we need some recommended approach or generic example of it, when I first started using this library, I couldn’t figure it out, I think it’s uncommon approach, especially for someone who was using REST. We are used to consider such objects as some kind of DTO

👍 1

neetkee

06/25/2020, 6:54 PM

it depends how you structure your graph

So, if my graph may contain different objects, that use some shared source of data, is there any way to make less external calls? These objects could contain different arguments that were passed by our clients, or resolved during execution. I’m not sure if deffered variable can help here

Dariusz Kuc

06/25/2020, 6:55 PM

In your

analytics

and

products

example I don’t think some common deferred would work as they dont share common parent

Dariusz Kuc

06/25/2020, 6:57 PM

there is always an option to inspect the query using execution environment (from top field) and then manually decide what to call - but again it would be manual and also wouldn’t help with 2 top level queries

53 Views

Open in Slack

Previous Next