To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 遑晢スュ霆ク訷大ォ厭遑晢スュ霆ク訷大ォ閲^ 111001111010000110011101111011111011110110101101111010001011101110111000111110111010010010010001111001011010101110001001011111011110011110100001100111011110111110111101101011011110100010111011101110001111101110100100100100011110010110101011100010010111101101011110 e7a19defbdade8bbb8fba491e5ab897de7a19defbdade8bbb8fba491e5ab897b5e
EUC-JP 遑晢スュ霆ク訷大ォ厭遑晢スュ霆ク訷大ォ閲^ 11101110101000111101101011110001100011101011110110001110101011011111000010111101100011101011100010001111110111011101010011000010111001111000111010101011101100011101111011101110101000111101101011110001100011101011110110001110101011011111000010111101100011101011100010001111110111011101010011000010111001111000111010101011101100011101110001011110 eea3daf18ebd8eadf0bd8eb88fddd4c2e78eabb1deeea3daf18ebd8eadf0bd8eb88fddd4c2e78eabb1dc5e
UTF-8 遑晢スュ霆ク訷大ォ厭遑晢スュ霆ク訷大ォ閲^ 11101001100000011001000111100110100110011010001011101111101111011011110111101111101111011010110111101001100111001000011011101111101111011011100011101000101010001011011111100101101001001010011111101111101111011010101111100101100011101010110111101001100000011001000111100110100110011010001011101111101111011011110111101111101111011010110111101001100111001000011011101111101111011011100011101000101010001011011111100101101001001010011111101111101111011010101111101001100101101011001001011110 e98191e699a2efbdbdefbdade99c86efbdb8e8a8b7e5a4a7efbdabe58eade98191e699a2efbdbdefbdade99c86efbdb8e8a8b7e5a4a7efbdabe996b25e
UHC 遑???霆??大?厭遑???霆??大??^ 11111100110110100011111100111111001111111110111111111101001111110011111111010011110111100011111111100110111101001111110011011010001111110011111100111111111011111111110100111111001111111101001111011110001111110011111101011110 fcda3f3f3feffd3f3fd3de3fe6f4fcda3f3f3feffd3f3fd3de3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)