To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 鰲k?壓??壅?n}鰲k?壓??壅?n{^ 1110100111100000100000101000101100111111100110101101100000111111001111111001101011010111001111110110111001111101111010011110000010000010100010110011111110011010110110000011111100111111100110101101011100111111011011100111101101011110 e9e0828b3f9ad83f3f9ad73f6e7de9e0828b3f9ad83f3f9ad73f6e7b5e
EUC-JP 鰲k?壓??壅?n}鰲k?壓??壅?n{^ 1111001011100010101000111110101100111111110101001101101000111111001111111101010011011001001111110110111001111101111100101110001010100011111010110011111111010100110110100011111100111111110101001101100100111111011011100111101101011110 f2e2a3eb3fd4da3f3fd4d93f6e7df2e2a3eb3fd4da3f3fd4d93f6e7b5e
UTF-8 鰲k젒壓꾩뿁壅쿽n}鰲k젒壓꾩뿁壅쿽n{^ 1110100110110000101100101110111110111101100010111110110010100000100100101110010110100011100100111110101010111110101010011110101110111111100000011110010110100011100001011110110010111111101111010110111001111101111010011011000010110010111011111011110110001011111011001010000010010010111001011010001110010011111010101011111010101001111010111011111110000001111001011010001110000101111011001011111110111101011011100111101101011110 e9b0b2efbd8beca092e5a393eabea9ebbf81e5a385ecbfbd6e7de9b0b2efbd8beca092e5a393eabea9ebbf81e5a385ecbfbd6e7b5e
UHC 鰲k젒壓꾩뿁壅쿽n}鰲k젒壓꾩뿁壅쿽n{^ 11101000101001111010001111101011101000001001000111100100111000101000010011101100100101111000100111101000101101011011001101101111011011100111110111101000101001111010001111101011101000001001000111100100111000101000010011101100100101111000100111101000101101011011001101101111011011100111101101011110 e8a7a3eba091e4e284ec9789e8b5b36f6e7de8a7a3eba091e4e284ec9789e8b5b36f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)