To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 嚥???⑤????n}嚥???⑤????n{^ 100110101000101100111111001111110011111110000111010001000011111100111111001111110011111101101110011111011001101010001011001111110011111100111111100001110100010000111111001111110011111100111111011011100111101101011110 9a8b3f3f3f87443f3f3f3f6e7d9a8b3f3f3f87443f3f3f3f6e7b5e
EUC-JP 嚥??獒?????n}嚥??獒?????n{^ 1101001111101011001111110011111110001111110010111011101100111111001111110011111100111111001111110110111001111101110100111110101100111111001111111000111111001011101110110011111100111111001111110011111100111111011011100111101101011110 d3eb3f3f8fcbbb3f3f3f3f3f6e7dd3eb3f3f8fcbbb3f3f3f3f3f6e7b5e
UTF-8 嚥좎뼏獒⑤쨽麗볧넍n}嚥좎뼏獒⑤쨽麗볧넍n{^ 1110010110011010101001011110110010100010100011101110101110111100100011111110011110001101100100101110001010010001101001001110110010101000101111011110111110100110100010001110101110110011101001111110101110000100100011010110111001111101111001011001101010100101111011001010001010001110111010111011110010001111111001111000110110010010111000101001000110100100111011001010100010111101111011111010011010001000111010111011001110100111111010111000010010001101011011100111101101011110 e59aa5eca28eebbc8fe78d92e291a4eca8bdefa688ebb3a7eb848d6e7de59aa5eca28eebbc8fe78d92e291a4eca8bdefa688ebb3a7eb848d6e7b5e
UHC 嚥좎뼏獒⑤쨽麗볧넍n}嚥좎뼏獒⑤쨽麗볧넍n{^ 1110011010111111101000001110110010010110100101111110100010100011101010001110101110100100100101111110011010110000100100111110110110000110100110010110111001111101111001101011111110100000111011001001011010010111111010001010001110101000111010111010010010010111111001101011000010010011111011011000011010011001011011100111101101011110 e6bfa0ec9697e8a3a8eba497e6b093ed86996e7de6bfa0ec9697e8a3a8eba497e6b093ed86996e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)