To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 閻??誼?┥怨??閻??誼?┥怨??^ 111010001000010100111111001111111000101101100010001111111000010010111100100010011000010100111111001111111110100010000101001111110011111110001011011000100011111110000100101111001000100110000101001111110011111101011110 e8853f3f8b623f84bc89853f3fe8853f3f8b623f84bc89853f3f5e
EUC-JP 閻??誼?┥怨??閻??誼?┥怨??^ 111011111110010100111111001111111011010111000011001111111010100010111110101100011110010100111111001111111110111111100101001111110011111110110101110000110011111110101000101111101011000111100101001111110011111101011110 efe53f3fb5c33fa8beb1e53f3fefe53f3fb5c33fa8beb1e53f3f5e
UTF-8 閻띯뫖誼놂┥怨쀫솭閻띯뫖誼놂┥怨쀫솦^ 11101001100101101011101111101011100111011010111111101011101010111001011011101000101010101011110011101011100001101000001011100010100101001010010111100110100000001010100011101100100000001010101111101100100001101010110111101001100101101011101111101011100111011010111111101011101010111001011011101000101010101011110011101011100001101000001011100010100101001010010111100110100000001010100011101100100000001010101111101100100001101010011001011110 e996bbeb9dafebab96e8aabceb8682e294a5e680a8ec80abec86ade996bbeb9dafebab96e8aabceb8682e294a5e680a8ec80abec86a65e
UHC 閻띯뫖誼놂┥怨쀫솭閻띯뫖誼놂┥怨쀫솦^ 11100111101000101000110111100010100100011011100011101011111111101011001111101111101001101011111011101010101100111001011111101011100110011010001111100111101000101000110111100010100100011011100011101011111111101011001111101111101001101011111011101010101100111001011111101011100110011001111101011110 e7a28de291b8ebfeb3efa6beeab397eb99a3e7a28de291b8ebfeb3efa6beeab397eb999f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)