# 202

Sollicitatievragen voor Senior Machine Learning Engineer gedeeld door sollicitanten

## Meest gestelde sollicitatievragen

Sorteren: Relevantie|Populair|Datum
Er werd een Senior Machine Learning Engineer gevraagd...23 januari 2021

2 antwoorden

What was the question?

Thnk god i saw this, i have also been told the same

### It was asked a vague question, It was the input of an ML model but they wanted me to treat it as a string. Basicly Read a string and put it in the hashmap.

2 antwoorden

I asked a few follow-up questions and I asked what is the end of the line character, as I did not remember it!, then I solved with a hasmap and going over the string. The question with no means was a good question to evaluate the coding skills of someone. Minder

I would learn character level embedding and pass them through a shallow neural network (assuming the number of characters in the string is small). The output layer would be tanh layer (squashes number in [-1, 1] range. I would use the output to a fixed size hash, where ith bit value would be 0 if activation is negative and 1 if activation is positive. Try checking Random Projection Algorithm used for Locality Sensitive Hashing idea Minder

### what are python generators?

2 antwoorden

USER_ID_LIST=[1,2,3,4,5,6,7,8,9] def get_user_ids(): for id in USER_ID_LIST: yield id if __name__ == "__main__": user_ids = get_user_ids() print("First Loop") for user_id in user_ids: print(user_id) if user_id == 5: break print("Second Loop") for user_id in user_ids: print(user_id) ------output----- First Loop 1 2 3 4 5 Second Loop 6 7 8 9 Minder

Generator functions allow you to declare a function that behaves like an iterator. Generators introduce the yield statement to Python. It works a bit like return because it returns a value. The difference is that it saves the state of the function. The next time the function is called, execution continues from where it left off, with the same variable values it had before yielding. USER_ID_LIST=[1,2,3,4,5,6,7,8,9] def get_user_ids(): for id in USER_ID_LIST: yield id if __name__ == "__main__": user_ids = get_user_ids() print("First Loop") for user_id in user_ids: print(user_id) if user_id == 5: break print("Second Loop") for user_id in user_ids: print(user_id) Minder

### What role does the test set play and what role does the validation set play when we are building a model?

1 antwoorden

If we are developing an iterative model (e.g. gradient boosting or NNs), then we can use a test set to validate each iteration and update the weights accordingly. After running through all iterations, we want to test how our final model performs on a held-out dataset -- the validation dataset. Note that terminology "test" and "validation" is often used interchangeably Minder

### Take home project (which I think for senior+ roles is inappropriate; ask me to share work I've put into production, there's plenty of it).

1 antwoorden

Asked to only spend 3 hours on it, and even after my best efforts I spent 4 and couldn't meet their expectations. Instead I tried to tackle the problem as I would in production: small incremental value gains. I also challenged the problem itself given the data (again, as I would in a production setting to set expectations about what we can realistically build). I was told I didn't do enough in the project to solve the problem they asked. Again, impossible to do in 3 hours and secondly, I fundamentally disagreed with the expectation that the problem can be "solved". Another note: the reviewer didn't look at my project before the interview. I understand things come up but if I'm expected to take my own time to do the work, the least I can expect is preparation on the other side. So my two main issues: 1. asking for proof that a Senior candidate knows how to build a classifier on text data (I wouldn't have 8+ years in the field if I can't do that) 2. not engaging in the larger (and frankly, more important, conversation that your senior people should be having about how to build reliable and trustworthy models) Minder

### What is Super and how is it used?

1 antwoorden

Super is used to reference the inherited class within the inherited class. It is a shortcut. Minder

### What is the difference between Vertical and Horizontal Scaling?

1 antwoorden

Vertical scaling increases resources of a single computing unit like increasing RAM and storage on a computer while horizontal scaling increases the number of computing units Minder

1 antwoorden

### What is term frequency in tf-idf?

1 antwoorden

Term frequency computes the frequency of occurrence of a term in a sentence normalised by the total number of terms in that sentence. Minder

### Compute the intersection over union, given 2 rectangles. For Each rectangle input representation would be topleft vertex and bottom right vertex.

1 antwoorden

Tried answering via the coordinate geometry approach instead of using max. The interviewer was very kind and helped me towards getting the solution. I spent too much time and could not finish it within the given 40 min. Minder

Weergave: 1 - 10 van 202 sollicitatievragen