Commit Graph

108 Commits

Author SHA1 Message Date
Vijay Joshi
843274cf48 INFRA-3703 : Houston side changes to accomodate QA use case (#452)
* INFRA-3703 : Houston side changes to accomodate QA use case

* INFRA-3703 : UT failure fix

* INFRA-3703 : Minor changes

* INFRA-3703 : Unique constraint

* INFRA-3703 : Edit migration file

* INFRA-3703 : PR review comments and UT's

* INFRA-3703 : Channel name resolution
2024-09-05 19:27:42 +05:30
Vijay Joshi
288a7f8457 INFRA-3664 : Make get users in conversation api more performant, add api to sync incident users, module to track request statuses and remove old add incident-user api (#451)
* INFRA-3664 : Make get users in conversation api more performant

* INFRA-3664 : Add apis for incident user sync and get users in incident performance improvements

* INFRA-3664 : Self review

* INFRA-3664 : Add migration script

* INFRA-3664 : Review comments

* INFRA-3664 : Constant chanes

* INFRA-3664 : Add rate limit constants

* INFRA-3664 : Add rate limit constants

* INFRA-3664 : Fix failing tests

* INFRA-3664 : Add UT's
2024-08-29 11:52:56 +05:30
Vijay Joshi
804be01c2f INFRA-3467 : Private Houston Incidents (#445)
* INFRA-3467 : Private Houston Incidents

* INFRA-3627 : Minor self review

* INFRA-3627 : PR Review changes

INFRA-3627 : Minor changes

INFRA-3627 : UT fix

INFRA-3637 : Message changes

INFRA-3627 : Minor changes

INFRA-3627 : Constant fix

INFRA-3627 : Do not post SLA breach in public channels for private incidents
2024-08-08 19:20:04 +05:30
Vijay Joshi
55da2b4791 INFRA-3570 : Do not show the current severity and status in update incident in slack UI (#439)
* INFRA-3570 : Do not show same severity and status in update incident in slack UI

* INFRA-3570 : Cyclic dependency fix

* INFRA-3570 : Minor changes

* INFRA-3570 : Add UT'S

* INFRA-3570 : Major refactor

* INFRA-3570 : Move all incident status repo functions to new service

* INFRA-3570 : Add UT's
2024-07-18 13:17:28 +05:30
Shashank Shekhar
7489f0fd77 INFRA-3544 | Adding Sev-4 in Houston (#438)
* INFRA-3554 | Adding sev-4 severity

* INFRA-3554 | Adding the migration queries
2024-07-15 17:49:40 +05:30
Amit Jambotkar
8915c79e18 INFRA-3437|Amit|Add resolve restriction (#435)
* INFRA-3437|Amit|Add resolve restriction

* INFRA-3437|Amit|Add resolve restriction

* INFRA-3437|Amit|Add resolve restriction

* INFRA-3437|Amit|Add resolve restriction

* INFRA-3437|Amit|Add resolve restriction
2024-07-14 16:13:41 +05:30
Vijay Joshi
293220ded8 INFRA-3565 : Remove retrospective and service owner in incident roles (#434)
* INFRA-3565 : Remove retrospective and service owner in incident roles

* INFRA-3565 : Build failure fix
2024-07-11 14:53:39 +05:30
Amit Jambotkar
3a3a0a7c26 INFRA-3437|Amit|Severity reduction (#427)
* INFRA-3437|Amit|Severity reduction

* INFRA-3437|Amit|Severity reduction

* INFRA-3437|Amit|Severity reduction
2024-06-18 17:54:10 +05:30
Vijay Joshi
3186979c34 INFRA-3362 : Filters based on reporting teams and products (#428)
* INFRA-3362 : Filters based on reporting teams and products

* INFRA-3362 : Add grouping in query

* INFRA-3362 : Fix formatting

* INFRA-3362 : Fix formatting
2024-06-18 15:01:28 +05:30
Vijay Joshi
29f0c7bacc INFRA-3012 : Houston topic changes according to new construct (#415)
* INFRA-3012 : Houston title changes according to new construct

* INFRA-3012 : add titkle change for resolve and duplicate case

* INFRA-3012 : Failing tests fix

* INFRA-3012 : Added migration script for backfilling
2024-04-26 14:52:12 +05:30
Vijay Joshi
0ed941abc3 INFRA-3151 : Add unarchival listener to add houston bot to incident slack channel (#416)
* INFRA-3151 : Add unarchival listener to add houston bot to incident slack channel

* INFRA-3151 : review comments

* INFRA-3151 : format fix
2024-04-12 18:53:06 +05:30
Vijay Joshi
602db7741c INFRA-2887 : Incident reminder dm cron refactor/re-implementation (#413)
* INFRA-2887 : Incident reminder cron refactor/reimplementation

* INFRA-2887 : remove cron code

* INFRA-2887 : remove comment

* INFRA-2887 : Minor changes

* INFRA-2887 : PR comments

* INFRA-3121 : CLEANUP CRON, remove shedlock table and add UT's

* INFRA-2887 : Add default ack

* INFRA-2887 : Handler changes
2024-04-02 18:49:11 +05:30
Vijay Joshi
8cd72b46a3 INFRA-2887 : Upsert users cron refactor (#412)
* INFRA-2887 : Upsert users cron refactor

* INFRA-2887 : Review changes

* INFRA-2887 : added UT's

* INFRA-2887 : Second round review changes

* INFRA-2887 : Add default ack

* INFRA-2887 : Add default ack

* INFRA-2887 : make flow async
2024-04-02 16:25:34 +05:30
Vijay Joshi
0d613a4bfb INFRA-2887 : SLA breach heads up cron refactor/reimplementation (#411)
* INFRA-2887 : SLA breach heads up cron refactor/reimplementation

* INFRA-2887 : Code review comments
2024-04-01 19:18:38 +05:30
Vijay Joshi
15c25eeae3 INFRA-2887 : auto archive refactor (#410)
* INFRA-2887 : Refactor auto escalate

* INFRA-2887 : Old cron cleanup and waitgroup refactor

* INFRA-2887 : Remove redundant nil check

* INFRA-3101 : aDD UT'S AND RETRYABLE LOGIC

* INFRA-2887 : review comments
2024-03-28 15:09:44 +05:30
Vijay Joshi
a8a0d44da9 INFRA-3098 : Auto escalate refactor (#409)
* INFRA-3098 : Auto escalate refactor

* INFRA-3098 : Delete old cron code

* INFRA-2887 : File and api name changes

* INFRA-2887 : Code review comments
2024-03-27 19:06:22 +05:30
Vijay Joshi
164a29cb86 INFRA-2887 : Added api for post team metrics (#407)
* INFRA-2887 : Added api for post team metrics

* INFRA-2887 : Separate services into cron specific purpose

* INFRA-2887 : PR Review changes

* INFRA-2887 : Old cron cleanup

* INFRA-2887 : Add util fn

* INFRA-2887 : Ashvin's review comments

* INFRA-2887 : empty check on team channel
2024-03-25 14:26:44 +05:30
Shashank Shekhar
256fd54d36 Revert "INFRA-2830 | adding version column in incident entity (#403)" (#406)
This reverts commit 3945abcb8f.
2024-03-21 10:38:40 +05:30
Shashank Shekhar
3945abcb8f INFRA-2830 | adding version column in incident entity (#403) 2024-03-20 16:26:56 +05:30
Shashank Shekhar
38d5095fd9 INFRA-2866 | Fix inactive teams (#401)
* INFRA-2866 | adding condition to fetch only active teams

* INFRA-2866 | adding condition to fetch only active teams
2024-03-19 17:49:37 +05:30
Shashank Shekhar
233c632d38 INFRA-2866 | Create and update incident with assigner and responder from slack (#394)
* INFRA-2866 | create incident modal with product

* INFRA-2866 | Update product flow

* INFRA-2866 | Resolving review comments

* INFRA-2866 | Adding default values for product, assigner and responder

* INFRA-2866 | bug fix in getting assigner and responder team

* INFRA-2866 | bug-fix: users in no team are not getting products

* INFRA-2866 | adding log lines

* INFRA-2866 | adding assigner team members into incident

* INFRA-2866 | updated help command response text

* INFRA-2866 | adding assigner team members by severity

* INFRA-2866 | updating product list for users with no product

* INFRA-2866 | assigner teams = (teamsOfUser ++ teamsOfSelectedProducts)

* INFRA-2866 | renamed assigner to reporting team

* INFRA-2866 | query to seed product as others for current open incidents without any product
2024-03-19 16:26:30 +05:30
Vijay Joshi
9e97bc8d98 INFRA-3009 : Exclude monitoring state incidents from auto escalation (#395)
* INFRA-3009 : Exclude monitoring state incidents from auto escalation

* INFRA-3009 : Constant for sla statuses
2024-03-13 16:27:24 +05:30
Vijay Joshi
2eba625b0d INFRA-2888 : Added custom metrics on all major flows (#393)
* INFRA-2888 : Added alerts on all major flows

* INFRA-2888 : Remove unnecessary space

* INFRA-2888 : Metric handler

* INFRA-2888 : Review changes

* INFRA-2888 : Build fix

* INFRA-2888 : Code cleanup

* INFRA-2888 : Review comments round 1

* INFRA-2888 : Err msg changes

* INFRA-2888 : task to job in name
2024-03-12 19:56:52 +05:30
Vijay Joshi
b7f9e6b7e1 Revert "INFRA-2931 : Add null state archival cleanup of channels (#390)" (#392)
This reverts commit 1eea987453.
2024-03-07 11:05:24 +05:30
Vijay Joshi
1eea987453 INFRA-2931 : Add null state archival cleanup of channels (#390) 2024-03-06 15:02:06 +05:30
Vijay Joshi
793c9183ec INFRA-2931 : Auto archival of all severity incidents (#387)
* INFRA-2931 : Basic setup for auto archival

* INFRA-2931 : Cron implemented for auto archival

* INFRA-2931 : query based on config map

* INFRA-2931 : change query to is false

* INFRA-2931 : Minor changes in names and duplicate archival conditions
2024-03-05 19:25:18 +05:30
Shashank Shekhar
d4d7da3328 INFRA-2866 | Create and update incident api changes (#386)
* INFRA-2866 | added APIs to get product for user and to get asigner and responder teams

* INFRA-2866 | added create-incident-v3 API

* INFRA-2866 | migration script to fill team_severity, team_user and team_user_severity tables

* INFRA-2866 | adding team severity users upon team and severity update

* INFRA-2866 | using update team v2 in slack action

* INFRA-2866 | update product flow

* INFRA-2866 | fixed user not invited issue

* INFRA-2866 | updated API paths

* INFRA-2866 | using constant for header fetching

* INFRA-2866 | PR review changes
2024-03-05 15:26:00 +05:30
Vijay Joshi
17acefb2a8 INFRA-2901 : Add severity team function for incident use cases (#383)
* INFRA-2901 : Add severity team function for incident use cases

* INFRA-2901 : Add migration scripts as single and add nullability in bot and channel response

---------

Co-authored-by: Shashank Shekhar <shashank.shekhar@navi.com>
2024-02-28 18:20:10 +05:30
Shashank Shekhar
c349cd65d5 INFRA-2945 | incident_products entity, repo, service and handler (#382)
* INFRA-2945 | incident_products entity, repo, service and handler

* INFRA-2945 | added product details in the incident DTO and removed incident_product_handler
2024-02-28 14:35:22 +05:30
Vijay Joshi
983c743609 INFRA-2873 : Team Management V2 - Update team, remove member, make manager, get team details Phase 2 changes (#381)
* INFRA-2873 : Completed till update team and remove member flow

* INFRA-2873 : Complete make manager, remove member, get teams flow

* INFRA-2873 : Minor auth service and rebase changes

* INFRA-2873 : Minor changes

* INFRA-2873 : Minor changes

* INFRA-2873 : PR Review changes

* INFRA-2873 : Added UTs

* INFRA-2873 : Second round review changes
2024-02-27 17:02:22 +05:30
Shashank Shekhar
9b1ff52994 INFRA-2922 | product-team orchestration service (#379)
* INFRA-2922 | product-team orchestration service

* INFRA-2922 | UT fixes
2024-02-26 13:29:49 +05:30
Vijay Joshi
7c1282711b INFRA-2873 : Severity wise team member list - Team Management Module - Add team (#372)
* INFRA-2873 : Boilerplate setup for team management revamp:

* INFRA-2873 : Complete till add team flow

* INFRA-2873 : Added unit tests and migration scripts

* INFRA-2873 : Code review comments

* INFRA-2873 : Add getter for team severity

* INFRA-2873 : Second round of review
2024-02-23 16:05:20 +05:30
Shashank Shekhar
94cb9758a0 INFRA-2915 | Products teams mapping (#378)
* INFRA-2915 | ProductTeams entity, Model and Repo

* INFRA-2915 | ProductTeams service, handler and tests

* INFRA-2915 | Added composit unique constrain for product and team ids
2024-02-21 16:19:15 +05:30
Shashank Shekhar
aa8161ca8b INFRA-2911 | Product service, repo and handler (#373)
* INFRA-2911 | Product entity, modal and repository

* INFRA-2911 | Product service and handler

* INFRA-2911 | Product service test

* INFRA-2911 | Optimisations

* INFRA-2911 | Optimisations
2024-02-19 18:57:10 +05:30
Shashank Shekhar
36d590221c INFRA-2829 | Implemented transaction in create incident flow (#371)
* INFRA-2829 | Implemented transaction in create incident flow

* INFRA-2829 | created util func for rollback

* INFRA-2829 | removed redundant cod to create slack channel
2024-02-16 14:47:22 +05:30
Vijay Joshi
976c12bf83 INFRA-2897 : [DT] Add configurable archival interval in archival cron (#369)
Co-authored-by: Shashank Shekhar <shashank.shekhar@navi.com>
2024-02-15 20:08:36 +05:30
Vijay Joshi
358799442e INFRA-2828 : Drop unused tables, update migrations and removed unused code modules (#367) 2024-02-13 15:43:24 +05:30
Vijay Joshi
f8b286adb1 INFRA-2856 : Added alerts for incident creation, resolve and zenduty failures (#366) 2024-02-12 19:11:02 +05:30
Gullipalli Chetan Kumar
a4c648649b TP-54496 | Created justification message prompt for de-escalation (#360)
* TP-54496| created justification message prompt for de-escalation

* TP-54496| added migration file to update new column in log table

* TP-54496| added feature flag

* TP-54496| created util functions and constants

* TP-54496| updated design changes

* TP-54496| made the requested changed in PR comments

* TP-54496| fixed bugs in merge conflicts

* TP-54496| acknowledging to slack before hand so to not time out

* TP-54496| modified log entity field justification

---------

Co-authored-by: Shashank Shekhar <shashank.shekhar@navi.com>
2024-02-07 18:11:50 +05:30
Vijay Joshi
ad96361d68 TP-49979 , TP-52174 : API to get resolution tags + resolve incident API + incident resolve entire flow refactor (#347)
* TP-49979 : Added API to get tags for resolving incident

* TP-49979 : Set up basic structure for resolve incident from UI

* TP-49979 : Complete till post rca flow

* TP-49979 : Complete till rca gen flow

* TP-52174 : rebase changes

* TP-52174 : Integrate with slack

* TP-52174 : fix error in flows

* TP-52174 : Segregate interface and impl

* TP-52174 : Fix ut failures

* TP-52174 : Fix resolve tag api error

* TP-52174 : Fix jira link bug

* TP-52174 : Remove nil

* TP-52174 : Rebase changes

* TP-52174 : Jira links table fix

* TP-52174 : Line length fix

* TP-52174 : Makefile changes

* TP-52174 : Basic bug fixes

* TP-52174 : Minor fixes

* TP-52174 : Add UT's for initial flows

* TP-52174 : Added all UT's

* TP-52174 : More PR review changes

* TP-52174 : Add UT's for incident jira and tag service

* TP-52174 : Fix jira link bug and batched create incident tags db call

* TP-52174 : Make auto archival severities configurable

* TP-52174 : Fix jira link in incident table issue
2024-02-01 15:23:15 +05:30
Ajay Devarakonda
0b9fca2b2f TP-55036 | Added alert for Krakatoa workflow failures while fetching grafana images and csv file (#356)
* TP-38709 | Merging the changes to master on the logfix

* TP-55036 | Added alert for krakatoa failures if there are grafana errors or csv errors

* TP-55036 | Fixed unit tests
2024-01-23 19:02:47 +05:30
Ajay Devarakonda
45fce5521f TP-55035 | Added alerts for conference event creation, fetching and deletion failures (#357)
* TP-38709 | Merging the changes to master on the logfix

* TP-55035 | Added alerts for failures while creating, fetching and deleting conference events

* TP-55035 | Fixed conflicts

* TP-55035 | trying guage metric to reset counter

* TP-55035 | Revertingn guage metrics
2024-01-23 18:51:56 +05:30
Ajay Devarakonda
d8d17ef901 TP-54479 | Fixed backend to accept date and timestamp in get incidents api call (#352)
* TP-38709 | Merging the changes to master on the logfix

* TP-54479 | Fixed get incidents to acceept time stamp for time zone based results
2024-01-19 18:39:37 +05:30
Ajay Devarakonda
7b5161803b TP-53168 | Added alert when RCA generation fails while sending request to Gen AI or when we receive failure from Gen AI (#354)
* TP-38709 | Merging the changes to master on the logfix

* TP-53168 | Implemented metrics publishment on failure of RCA generation

* TP-53168 | Added rca generation failure counter to must register alerts

* TP-53168 | fixed log level on rca failure

* TP-53168 | Added incident id as part of log for rca failure
2024-01-19 17:45:53 +05:30
Gullipalli Chetan Kumar
8e7619f972 TP-52454 : Created Zenduty integration (#348)
* TP-52454| created zenduty integration

* TP-52454| added migration script for external team table

* TP-52454| added extra logs

* TP-52454| modified logs

* TP-52454|added extra logs

* TP-52454| changed post url for zenduty

* TP-52454| fixed bugs in zenduty client

* TP-52454| created constants for environmental varibales

* TP-52454| enabled zenduty if severity is less than or equal to the defined config
2024-01-12 14:24:19 +05:30
Shashank Shekhar
d962754050 TP-52227 | adding custom metric for slack channel creation failures (#338) 2023-12-26 14:34:22 +05:30
Gullipalli Chetan Kumar
aeb572f47e TP-45807 : Sending google transcripts to gen ai (#315)
* created service for sending google transcripts to gen ai

* TP-45807| resolved bugs in drive service tests

* TP-45807| unit tests for getting conversation data function

* creating driveservice in app context and passing to rca service

* modified the unit tests to accomodate driveservicemock

* resolved merge conflicts

* resolved merge conflicts
2023-12-26 14:28:27 +05:30
Gullipalli Chetan Kumar
74c1b88b3d TP-51709 : Enabled Marking an Incident as Duplicate through update Incident API (#336)
* TP-51709| created mark-duplicate-incident-status function

* TP-51709| made the duplicate status code modular
2023-12-22 14:18:43 +05:30
Shashank Shekhar
5758e603e8 Jira link table (#331)
* TP-51013 | incident_jira entity, repo and service

* TP-51013 | get jira status api

* TP-51013 | added db migration file

* TP-51013 | added migration query to migrate existing jira links into new table

* TP-51013 | removing linked_jira_issues column from incident table

* TP-51013 | removing empty jira fields if no response found for a jira key in jira api response

* TP-51013 | handled jira api failure cases, will return empty jira fields

* TP-51013 | removed linked_jira_issues field from incident entity

* TP-51013 | handled jira link addition and removal in slack action

* TP-51013 | resolving PR comments

* TP-51013 | adding jira link max length check
2023-12-21 16:52:35 +05:30
Ajay Devarakonda
a62ecbe0a5 TP-48512 | Implementation of RCA and tag migration (#296)
* TP-38709 | Merging the changes to master on the logfix

* TP-48512 | Added button element for RCA section and implemented fill rca details

* TP-48512 | Small fixes

* TP-48512 | adding unit tests

* TP-48512 | added unit tests

* TP-48512 | updated color code for rca card

* TP-48512	| Removed duplicate interface

* TP-48512	| Added one more unit test

* TP-48512 | added comments for jira link validation and update

* TP-48512 | Merging the changes to master on the logfix

# Conflicts:
#	cmd/app/handler/slack_handler.go

* TP-48512 | Added button element for RCA section and implemented fill rca details

# Conflicts:
#	common/util/common_util.go
#	common/util/constant.go
#	internal/processor/action/incident_resolve_action.go
#	internal/processor/action/incident_update_jira-links_action.go
#	internal/processor/action/incident_update_resolution_text_action.go
#	internal/processor/action/view/incident_resolution_text.go
#	internal/processor/action/view/incident_section.go
#	service/slack/slack_service.go

* TP-48512 | Small fixes

* TP-48512 | adding unit tests

* TP-48512 | added unit tests

# Conflicts:
#	Makefile
#	service/incident/incident_service_v2_interface.go

* TP-48512 | updated color code for rca card

* TP-48512	| Removed duplicate interface

* TP-48512	| Added one more unit test

* TP-48512 | added comments for jira link validation and update

* TP-48512 | Fixed merge conflicts

* TP-48512 | Fixed merge conflicts

* TP-48512 | Fixed merge conflicts

* TP-48512 | Added sql migration script for adding tags

* TP-48512 | Updated sql migration script for adding tags

* TP-48512 | Fixed merge conflicts and updated tags in sql migration script
2023-12-07 14:13:12 +05:30