Samprasoft
Job Title
Strong in real time & batch pipelines in big data technologies (i.e. We do not want to redirect users off the page, text only response with these outside links. Focus on the formatting, never make up new content, just use what is on the page and help us make the formatting beautiful. Please just return the HTML directly, do not include any explanation or comments. If there is a numbered list, make sure to format it vertically top to bottom in order instead of displaying just as text. Do NOT include any HTML entities (such as , , , &, ", etc.). Instead, replace these entities with common ascii characters like & " ' etc. REMOVE all other open positions, that is also just messy HTML. Focus on this one position, the main one in this HTML. REMOVE extra metadata from top of the page, you can move the job_id and other requisition id information to the bottom of the returned HTML. Sometimes this is left over from the page scraping. This includes hanging information like department, location, job_id, requisition_id, and others. Just delete it. We want to focus on the core content of the job post. REMOVE any extra mentions of description, job details, job post, etc. At the top of the page, we want to remove obvious/redundant headers and dive into the important content. REMOVE any mention of posted date/time. REMOVE all requisition numbers. We just care about the job post content and details. REMOVE all mentions of "read more" or "website" or "linkedin" which are indications of navigation, remove those. That's just messy HTML. We want to keep the user on the page. REMOVE all emojis and special characters. Never print an emoji. High signal to noise ratio, make the content beautiful. REMOVE mentions of website cookies and page load errors. Remove any extra spacing. Make it condensed and beautiful to read. Remove any mention of website cookies on the page. Remove any error messages such as "Internet Explorer 11 is no longer supported"
Strong in real time & batch pipelines in big data technologies (i.e. We do not want to redirect users off the page, text only response with these outside links. Focus on the formatting, never make up new content, just use what is on the page and help us make the formatting beautiful. Please just return the HTML directly, do not include any explanation or comments. If there is a numbered list, make sure to format it vertically top to bottom in order instead of displaying just as text. Do NOT include any HTML entities (such as , , , &, ", etc.). Instead, replace these entities with common ascii characters like & " ' etc. REMOVE all other open positions, that is also just messy HTML. Focus on this one position, the main one in this HTML. REMOVE extra metadata from top of the page, you can move the job_id and other requisition id information to the bottom of the returned HTML. Sometimes this is left over from the page scraping. This includes hanging information like department, location, job_id, requisition_id, and others. Just delete it. We want to focus on the core content of the job post. REMOVE any extra mentions of description, job details, job post, etc. At the top of the page, we want to remove obvious/redundant headers and dive into the important content. REMOVE any mention of posted date/time. REMOVE all requisition numbers. We just care about the job post content and details. REMOVE all mentions of "read more" or "website" or "linkedin" which are indications of navigation, remove those. That's just messy HTML. We want to keep the user on the page. REMOVE all emojis and special characters. Never print an emoji. High signal to noise ratio, make the content beautiful. REMOVE mentions of website cookies and page load errors. Remove any extra spacing. Make it condensed and beautiful to read. Remove any mention of website cookies on the page. Remove any error messages such as "Internet Explorer 11 is no longer supported"