Commit Graph

100 Commits

Author SHA1 Message Date
Vijay Joshi
5332db8ed3 INFRA-3811 : Tag Value Management Module (#458)
* INFRA-3811 : Tag Value Management Module

* INFRA-3811 : PR review comments

* INFRA-3811 : Entity/Association update fix
2024-09-25 15:23:32 +05:30
Vijay Joshi
fd2a977e82 INFRA-3705 : Creation of RCA ticket for sev-0 non-escalated non-CX incidents (#456)
* INFRA-3705 : Creation of RCA ticket for sev-0 non-escalated non-CX incidents

* INFRA-3705 : Self review

* INFRA-3705 : Fix tests:

* INFRA-3705 : UT's and minor constant changes

* INFRA-3705 : Add migration script
2024-09-17 10:01:07 +05:30
Dhruv Joshi
5c46ee2406 INFRA-3635 | Dhruv | Adds auto incident status change to investigating flow (#455)
* INFRA-3635 | Dhruv | Adds auto incident status change to investigating flow

* INFRA-3635 | Dhruv | Reformats to remove too many changes visible

* INFRA-3635 | Dhruv | Adds initial tests

* INFRA-3635 | Dhruv | fix pr comments

* INFRA-3635 | Dhruv | Fix pr comments

* INFRA-3635 | Dhruv | removes unwanted file change

* INFRA-3635 | Dhruv | initial tests

* INFRA-3635 | Dhruv | updates tests
2024-09-12 16:37:27 +05:30
Vijay Joshi
843274cf48 INFRA-3703 : Houston side changes to accomodate QA use case (#452)
* INFRA-3703 : Houston side changes to accomodate QA use case

* INFRA-3703 : UT failure fix

* INFRA-3703 : Minor changes

* INFRA-3703 : Unique constraint

* INFRA-3703 : Edit migration file

* INFRA-3703 : PR review comments and UT's

* INFRA-3703 : Channel name resolution
2024-09-05 19:27:42 +05:30
Vijay Joshi
288a7f8457 INFRA-3664 : Make get users in conversation api more performant, add api to sync incident users, module to track request statuses and remove old add incident-user api (#451)
* INFRA-3664 : Make get users in conversation api more performant

* INFRA-3664 : Add apis for incident user sync and get users in incident performance improvements

* INFRA-3664 : Self review

* INFRA-3664 : Add migration script

* INFRA-3664 : Review comments

* INFRA-3664 : Constant chanes

* INFRA-3664 : Add rate limit constants

* INFRA-3664 : Add rate limit constants

* INFRA-3664 : Fix failing tests

* INFRA-3664 : Add UT's
2024-08-29 11:52:56 +05:30
Vijay Joshi
804be01c2f INFRA-3467 : Private Houston Incidents (#445)
* INFRA-3467 : Private Houston Incidents

* INFRA-3627 : Minor self review

* INFRA-3627 : PR Review changes

INFRA-3627 : Minor changes

INFRA-3627 : UT fix

INFRA-3637 : Message changes

INFRA-3627 : Minor changes

INFRA-3627 : Constant fix

INFRA-3627 : Do not post SLA breach in public channels for private incidents
2024-08-08 19:20:04 +05:30
Vijay Joshi
55da2b4791 INFRA-3570 : Do not show the current severity and status in update incident in slack UI (#439)
* INFRA-3570 : Do not show same severity and status in update incident in slack UI

* INFRA-3570 : Cyclic dependency fix

* INFRA-3570 : Minor changes

* INFRA-3570 : Add UT'S

* INFRA-3570 : Major refactor

* INFRA-3570 : Move all incident status repo functions to new service

* INFRA-3570 : Add UT's
2024-07-18 13:17:28 +05:30
Vijay Joshi
1f1679b272 INFRA-3126 : Cleanup of deprecated API's and dead code (#414)
* INFRA-3126 : Cleanup of deprecated API's and dead code

* INFRA-3126 : More cleanup
2024-04-15 17:28:39 +05:30
Vijay Joshi
0ed941abc3 INFRA-3151 : Add unarchival listener to add houston bot to incident slack channel (#416)
* INFRA-3151 : Add unarchival listener to add houston bot to incident slack channel

* INFRA-3151 : review comments

* INFRA-3151 : format fix
2024-04-12 18:53:06 +05:30
Vijay Joshi
602db7741c INFRA-2887 : Incident reminder dm cron refactor/re-implementation (#413)
* INFRA-2887 : Incident reminder cron refactor/reimplementation

* INFRA-2887 : remove cron code

* INFRA-2887 : remove comment

* INFRA-2887 : Minor changes

* INFRA-2887 : PR comments

* INFRA-3121 : CLEANUP CRON, remove shedlock table and add UT's

* INFRA-2887 : Add default ack

* INFRA-2887 : Handler changes
2024-04-02 18:49:11 +05:30
Vijay Joshi
8cd72b46a3 INFRA-2887 : Upsert users cron refactor (#412)
* INFRA-2887 : Upsert users cron refactor

* INFRA-2887 : Review changes

* INFRA-2887 : added UT's

* INFRA-2887 : Second round review changes

* INFRA-2887 : Add default ack

* INFRA-2887 : Add default ack

* INFRA-2887 : make flow async
2024-04-02 16:25:34 +05:30
Vijay Joshi
0d613a4bfb INFRA-2887 : SLA breach heads up cron refactor/reimplementation (#411)
* INFRA-2887 : SLA breach heads up cron refactor/reimplementation

* INFRA-2887 : Code review comments
2024-04-01 19:18:38 +05:30
Vijay Joshi
15c25eeae3 INFRA-2887 : auto archive refactor (#410)
* INFRA-2887 : Refactor auto escalate

* INFRA-2887 : Old cron cleanup and waitgroup refactor

* INFRA-2887 : Remove redundant nil check

* INFRA-3101 : aDD UT'S AND RETRYABLE LOGIC

* INFRA-2887 : review comments
2024-03-28 15:09:44 +05:30
Vijay Joshi
a8a0d44da9 INFRA-3098 : Auto escalate refactor (#409)
* INFRA-3098 : Auto escalate refactor

* INFRA-3098 : Delete old cron code

* INFRA-2887 : File and api name changes

* INFRA-2887 : Code review comments
2024-03-27 19:06:22 +05:30
Vijay Joshi
164a29cb86 INFRA-2887 : Added api for post team metrics (#407)
* INFRA-2887 : Added api for post team metrics

* INFRA-2887 : Separate services into cron specific purpose

* INFRA-2887 : PR Review changes

* INFRA-2887 : Old cron cleanup

* INFRA-2887 : Add util fn

* INFRA-2887 : Ashvin's review comments

* INFRA-2887 : empty check on team channel
2024-03-25 14:26:44 +05:30
Shashank Shekhar
b88006f469 INFRA-2866 | removing houston user check from get user products API (#408) 2024-03-22 17:17:42 +05:30
Shashank Shekhar
233c632d38 INFRA-2866 | Create and update incident with assigner and responder from slack (#394)
* INFRA-2866 | create incident modal with product

* INFRA-2866 | Update product flow

* INFRA-2866 | Resolving review comments

* INFRA-2866 | Adding default values for product, assigner and responder

* INFRA-2866 | bug fix in getting assigner and responder team

* INFRA-2866 | bug-fix: users in no team are not getting products

* INFRA-2866 | adding log lines

* INFRA-2866 | adding assigner team members into incident

* INFRA-2866 | updated help command response text

* INFRA-2866 | adding assigner team members by severity

* INFRA-2866 | updating product list for users with no product

* INFRA-2866 | assigner teams = (teamsOfUser ++ teamsOfSelectedProducts)

* INFRA-2866 | renamed assigner to reporting team

* INFRA-2866 | query to seed product as others for current open incidents without any product
2024-03-19 16:26:30 +05:30
Vijay Joshi
2eba625b0d INFRA-2888 : Added custom metrics on all major flows (#393)
* INFRA-2888 : Added alerts on all major flows

* INFRA-2888 : Remove unnecessary space

* INFRA-2888 : Metric handler

* INFRA-2888 : Review changes

* INFRA-2888 : Build fix

* INFRA-2888 : Code cleanup

* INFRA-2888 : Review comments round 1

* INFRA-2888 : Err msg changes

* INFRA-2888 : task to job in name
2024-03-12 19:56:52 +05:30
Vijay Joshi
b7f9e6b7e1 Revert "INFRA-2931 : Add null state archival cleanup of channels (#390)" (#392)
This reverts commit 1eea987453.
2024-03-07 11:05:24 +05:30
Vijay Joshi
1eea987453 INFRA-2931 : Add null state archival cleanup of channels (#390) 2024-03-06 15:02:06 +05:30
Shashank Shekhar
d4d7da3328 INFRA-2866 | Create and update incident api changes (#386)
* INFRA-2866 | added APIs to get product for user and to get asigner and responder teams

* INFRA-2866 | added create-incident-v3 API

* INFRA-2866 | migration script to fill team_severity, team_user and team_user_severity tables

* INFRA-2866 | adding team severity users upon team and severity update

* INFRA-2866 | using update team v2 in slack action

* INFRA-2866 | update product flow

* INFRA-2866 | fixed user not invited issue

* INFRA-2866 | updated API paths

* INFRA-2866 | using constant for header fetching

* INFRA-2866 | PR review changes
2024-03-05 15:26:00 +05:30
Vijay Joshi
8ad6a842f8 INFRA-2971 : Add member api segregated (#385)
* INFRA-2971 : Add member api segregated

* INFRA-2971 : pr review changes

* INFRA-2971 : Message updated
2024-03-05 13:03:56 +05:30
Vijay Joshi
983c743609 INFRA-2873 : Team Management V2 - Update team, remove member, make manager, get team details Phase 2 changes (#381)
* INFRA-2873 : Completed till update team and remove member flow

* INFRA-2873 : Complete make manager, remove member, get teams flow

* INFRA-2873 : Minor auth service and rebase changes

* INFRA-2873 : Minor changes

* INFRA-2873 : Minor changes

* INFRA-2873 : PR Review changes

* INFRA-2873 : Added UTs

* INFRA-2873 : Second round review changes
2024-02-27 17:02:22 +05:30
Vijay Joshi
7c1282711b INFRA-2873 : Severity wise team member list - Team Management Module - Add team (#372)
* INFRA-2873 : Boilerplate setup for team management revamp:

* INFRA-2873 : Complete till add team flow

* INFRA-2873 : Added unit tests and migration scripts

* INFRA-2873 : Code review comments

* INFRA-2873 : Add getter for team severity

* INFRA-2873 : Second round of review
2024-02-23 16:05:20 +05:30
Shashank Shekhar
94cb9758a0 INFRA-2915 | Products teams mapping (#378)
* INFRA-2915 | ProductTeams entity, Model and Repo

* INFRA-2915 | ProductTeams service, handler and tests

* INFRA-2915 | Added composit unique constrain for product and team ids
2024-02-21 16:19:15 +05:30
Shashank Shekhar
c147de3a0c INFRA-2911 | updated error messages for product APIs (#376) 2024-02-20 13:08:20 +05:30
Shashank Shekhar
aa8161ca8b INFRA-2911 | Product service, repo and handler (#373)
* INFRA-2911 | Product entity, modal and repository

* INFRA-2911 | Product service and handler

* INFRA-2911 | Product service test

* INFRA-2911 | Optimisations

* INFRA-2911 | Optimisations
2024-02-19 18:57:10 +05:30
Vijay Joshi
358799442e INFRA-2828 : Drop unused tables, update migrations and removed unused code modules (#367) 2024-02-13 15:43:24 +05:30
Vijay Joshi
f8b286adb1 INFRA-2856 : Added alerts for incident creation, resolve and zenduty failures (#366) 2024-02-12 19:11:02 +05:30
Shashank Shekhar
e88123ede6 INFRA-2847 | Added API to remove member from severity (#365)
* INFRA-2847 | Added API to remove member from severity

* INFRA-2847 | Adding missing return statements after http responses

* INFRA-2847 | Checking if user is already part of sev list or not before removing

* INFRA-2847 | Updated error messages and code formatting
2024-02-12 18:16:12 +05:30
Vijay Joshi
ad96361d68 TP-49979 , TP-52174 : API to get resolution tags + resolve incident API + incident resolve entire flow refactor (#347)
* TP-49979 : Added API to get tags for resolving incident

* TP-49979 : Set up basic structure for resolve incident from UI

* TP-49979 : Complete till post rca flow

* TP-49979 : Complete till rca gen flow

* TP-52174 : rebase changes

* TP-52174 : Integrate with slack

* TP-52174 : fix error in flows

* TP-52174 : Segregate interface and impl

* TP-52174 : Fix ut failures

* TP-52174 : Fix resolve tag api error

* TP-52174 : Fix jira link bug

* TP-52174 : Remove nil

* TP-52174 : Rebase changes

* TP-52174 : Jira links table fix

* TP-52174 : Line length fix

* TP-52174 : Makefile changes

* TP-52174 : Basic bug fixes

* TP-52174 : Minor fixes

* TP-52174 : Add UT's for initial flows

* TP-52174 : Added all UT's

* TP-52174 : More PR review changes

* TP-52174 : Add UT's for incident jira and tag service

* TP-52174 : Fix jira link bug and batched create incident tags db call

* TP-52174 : Make auto archival severities configurable

* TP-52174 : Fix jira link in incident table issue
2024-02-01 15:23:15 +05:30
Ajay Devarakonda
0b9fca2b2f TP-55036 | Added alert for Krakatoa workflow failures while fetching grafana images and csv file (#356)
* TP-38709 | Merging the changes to master on the logfix

* TP-55036 | Added alert for krakatoa failures if there are grafana errors or csv errors

* TP-55036 | Fixed unit tests
2024-01-23 19:02:47 +05:30
Ajay Devarakonda
45fce5521f TP-55035 | Added alerts for conference event creation, fetching and deletion failures (#357)
* TP-38709 | Merging the changes to master on the logfix

* TP-55035 | Added alerts for failures while creating, fetching and deleting conference events

* TP-55035 | Fixed conflicts

* TP-55035 | trying guage metric to reset counter

* TP-55035 | Revertingn guage metrics
2024-01-23 18:51:56 +05:30
Ajay Devarakonda
7b5161803b TP-53168 | Added alert when RCA generation fails while sending request to Gen AI or when we receive failure from Gen AI (#354)
* TP-38709 | Merging the changes to master on the logfix

* TP-53168 | Implemented metrics publishment on failure of RCA generation

* TP-53168 | Added rca generation failure counter to must register alerts

* TP-53168 | fixed log level on rca failure

* TP-53168 | Added incident id as part of log for rca failure
2024-01-19 17:45:53 +05:30
Gullipalli Chetan Kumar
8e7619f972 TP-52454 : Created Zenduty integration (#348)
* TP-52454| created zenduty integration

* TP-52454| added migration script for external team table

* TP-52454| added extra logs

* TP-52454| modified logs

* TP-52454|added extra logs

* TP-52454| changed post url for zenduty

* TP-52454| fixed bugs in zenduty client

* TP-52454| created constants for environmental varibales

* TP-52454| enabled zenduty if severity is less than or equal to the defined config
2024-01-12 14:24:19 +05:30
Gullipalli Chetan Kumar
e4c4fec988 TP-50862 | Exposed an api to get slack conversation pre signed url (#349)
* TP-50862| created api to get slack conversation pre signed url

* TP-50862| added unit tests

* TP-50862| fixed authentication bug
2024-01-09 10:01:33 +05:30
Shashank Shekhar
777437652e TP-52227 | Deleting deprecatd API (#339) 2023-12-27 10:45:11 +05:30
Shashank Shekhar
d962754050 TP-52227 | adding custom metric for slack channel creation failures (#338) 2023-12-26 14:34:22 +05:30
Gullipalli Chetan Kumar
aeb572f47e TP-45807 : Sending google transcripts to gen ai (#315)
* created service for sending google transcripts to gen ai

* TP-45807| resolved bugs in drive service tests

* TP-45807| unit tests for getting conversation data function

* creating driveservice in app context and passing to rca service

* modified the unit tests to accomodate driveservicemock

* resolved merge conflicts

* resolved merge conflicts
2023-12-26 14:28:27 +05:30
Shashank Shekhar
5758e603e8 Jira link table (#331)
* TP-51013 | incident_jira entity, repo and service

* TP-51013 | get jira status api

* TP-51013 | added db migration file

* TP-51013 | added migration query to migrate existing jira links into new table

* TP-51013 | removing linked_jira_issues column from incident table

* TP-51013 | removing empty jira fields if no response found for a jira key in jira api response

* TP-51013 | handled jira api failure cases, will return empty jira fields

* TP-51013 | removed linked_jira_issues field from incident entity

* TP-51013 | handled jira link addition and removal in slack action

* TP-51013 | resolving PR comments

* TP-51013 | adding jira link max length check
2023-12-21 16:52:35 +05:30
Gullipalli Chetan Kumar
f31c75a1fb TP-51771 : Created Maverick Client (#333)
* TP-51771| created maverick client

* TP-51771| added check for empty data response form maverick

* made maverick client generic
2023-12-20 15:12:39 +05:30
Ajay Devarakonda
a62ecbe0a5 TP-48512 | Implementation of RCA and tag migration (#296)
* TP-38709 | Merging the changes to master on the logfix

* TP-48512 | Added button element for RCA section and implemented fill rca details

* TP-48512 | Small fixes

* TP-48512 | adding unit tests

* TP-48512 | added unit tests

* TP-48512 | updated color code for rca card

* TP-48512	| Removed duplicate interface

* TP-48512	| Added one more unit test

* TP-48512 | added comments for jira link validation and update

* TP-48512 | Merging the changes to master on the logfix

# Conflicts:
#	cmd/app/handler/slack_handler.go

* TP-48512 | Added button element for RCA section and implemented fill rca details

# Conflicts:
#	common/util/common_util.go
#	common/util/constant.go
#	internal/processor/action/incident_resolve_action.go
#	internal/processor/action/incident_update_jira-links_action.go
#	internal/processor/action/incident_update_resolution_text_action.go
#	internal/processor/action/view/incident_resolution_text.go
#	internal/processor/action/view/incident_section.go
#	service/slack/slack_service.go

* TP-48512 | Small fixes

* TP-48512 | adding unit tests

* TP-48512 | added unit tests

# Conflicts:
#	Makefile
#	service/incident/incident_service_v2_interface.go

* TP-48512 | updated color code for rca card

* TP-48512	| Removed duplicate interface

* TP-48512	| Added one more unit test

* TP-48512 | added comments for jira link validation and update

* TP-48512 | Fixed merge conflicts

* TP-48512 | Fixed merge conflicts

* TP-48512 | Fixed merge conflicts

* TP-48512 | Added sql migration script for adding tags

* TP-48512 | Updated sql migration script for adding tags

* TP-48512 | Fixed merge conflicts and updated tags in sql migration script
2023-12-07 14:13:12 +05:30
Gullipalli Chetan Kumar
c393b81bbc TP-47335 : Update get teams api to reduce latency by getting user data from database instead of slack (#284)
* TP-47335| created teamservice version 2 for get teams api

* TP-47335| modified the getusers info function to handle nil error

* refactored the structure of team service and created interfaces

* TP-47335| created unit tests

* TP-47335| added unit tests for get teams api

* resolved PR comments

* created custom error types

* made some changes in unit tests

* added unit tests for team handler

* solved merge conflicts

* solved invalid users bug

* resolved merge conflicts

* restricting incident title length to 100 characters

* removed unecessary comments
2023-12-04 15:16:21 +05:30
Shashank Shekhar
428900b58b TP-48564 | Added Jira client with search JQL api (#303) 2023-12-04 12:12:52 +05:30
Vijay Joshi
527ba2c04f TP-44155, TP-47355 : Update incident web refractor + Update severity slack refractor with unit tests (#262)
* TP-44155 : Update incident web refractor

* Resolution of v1 and v2 service calls

* PR review changes

* Rebase fixes

* TP-47355 : Add slack update severity refractor

* Cors fix

* Rebase fix

* Second PR revice changes

* More review changes

* Add concurrency to slack calls

* rebase

* Setup interfaces

* Added unit tests for update incident refractor

* Add more test cases

* Rebase changes

* Fix responder addeb by

* Fix build error

* Fix concurretn slack calls

* Revert rebase bug

* Shorten function length: added slack workflows

* Made fucntion size smaller
2023-11-30 14:24:29 +05:30
Shashank Shekhar
88459577f4 TP-49403 | parameterized slash command (#297)
* TP-49403 | parameterized slash command

* TP-49403 | handeling resolve and rca params also implemented Help-Commands button

* TP-49403 | using command pattern for command resolutiuon and execution

* TP-49403 | made find team by name and find severity by name queries case insensitive

* TP-49403 | updating help message keys
2023-11-30 11:56:32 +05:30
Gullipalli Chetan Kumar
2dd4d710e5 TP-47360 : Created service for uploading slack conversations to s3 and send request to maverick for generating RCA (#290)
* TP-47360| created services to upload slack conversation to cloud and send urls to generate RCA

* created rca input repository

* TP-47360| enabled generating rca service on resolution

* resolved merge conflicts

* TP-47360| added migration script for creating rca input table

* changed json response structure according to contract

* added unit tests

* removed api to make gen ai call

* made changes in message format posted in slack

* changed entity struct and adding flag to enable rca generation

* attaching title, description and replies related to incident status block

* made design changes in message format
2023-11-24 14:39:34 +05:30
Vijay Joshi
928a768b82 TP-48197 : Implementation of RCA Service, repo, handler, validator and webhook for posting RCA to incident channel along with unit tests (#282)
* TP-48197 : Implementation of RCA Service, repo, handler, validator and webhook for posting RCA to incident channel

* Added unit tests

* Added migration script"

* PR Review Changes

* Update migratrion
2023-11-16 12:03:34 +05:30
Shashank Shekhar
0f8c326fe7 TP-47297 | Slack DM for incident reminder (#276)
* TP-47297 | Slack DM for incident reminder - A cron job to send list of open incidents to every users who are part of it along with their role in it

* Delete common/util/config_util.go

---------

Co-authored-by: Md Anees <md.anees@navi.com>
2023-11-09 17:38:43 +05:30
Shashank Shekhar
f075b0df8f TP-48200 | updated response messages in link and unlink jira apis (#278) 2023-11-08 17:29:25 +05:30