To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????X???? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101100000111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f583f3f3f3f
SJIS-WIN ????????????夷??恁??X???? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110001000110011100011111100111111100111001000110000111111001111110101100000111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f88ce3f3f9c8c3f3f583f3f3f3f
EUC-JP ????????????夷??恁??X???? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110110000110100000011111100111111110101111110110000111111001111110101100000111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3fb0d03f3fd7ec3f3f583f3f3f3f
UTF-8 梨뚰삢吏쒖콠吏뺥삛梨숉삒夷덉콡恁삵샆X梨뚰삢吏 11101111101001111010001011101011100110101011000011101100100000101010001011101111101001111001111011101100100100101001011011101100101111011010000011101111101001111001111011101011101110101010010111101100100000101001101111101111101001111010001011101100100010001000100111101100100000101001001011100101101001001011011111101011100011011000100111101100101111011010000111100110100000011000000111101100100000101011010111101100100000111000011001011000111011111010011110100010111010111001101010110000111011001000001010100010111011111010011110011110 efa7a2eb9ab0ec82a2efa79eec9296ecbda0efa79eebbaa5ec829befa7a2ec8889ec8292e5a4b7eb8d89ecbda1e68181ec82b5ec838658efa7a2eb9ab0ec82a2efa79e
UHC 梨뚰삢吏쒖콠吏뺥삛梨숉삒夷덉콡恁삵샆X梨뚰삢吏 111011001011000110001100111011011001100010100011111011001010011110011100111011001011000110011000111011001010011110010101111011011001100010011110111011001011000110011001111011011001100010010111111011001010100010001000111011001011000110011001111011001111011010111011111011011001100010110111010110001110110010110001100011001110110110011000101000111110110010100111 ecb18ced98a3eca79cecb198eca795ed989eecb199ed9897eca888ecb199ecf6bbed98b758ecb18ced98a3eca7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)