To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????意????????柔?????裔 00111111001111110011111100111111001111110011111110001000110100110011111100111111001111110011111100111111001111110011111100111111100011110101111100111111001111110011111100111111001111111110010111100001 3f3f3f3f3f3f88d33f3f3f3f3f3f3f3f8f5f3f3f3f3f3fe5e1
EUC-JP ??????意????????柔??嫄??裔 001111110011111100111111001111110011111100111111101100001101010100111111001111110011111100111111001111110011111100111111001111111011110111000000001111110011111110001111101110101010000100111111001111111110101011100011 3f3f3f3f3f3fb0d53f3f3f3f3f3f3f3fbdc03f3f8fbaa13f3feae3
UTF-8 溜삳젙溜삳뿊意쎄굅烈숅젞溜뺣졎柔묒뵿嫄곁몯裔 111011111010011110001011111011001000001010110011111011001010000010011001111011111010011110001011111011001000001010110011111010111011111110001010111001101000010010001111111011001000111010000100111010101011010110000101111011111010011010011111111011001000100010000101111011001010000010011110111011111010011110001011111010111011101010100011111011001010000110001110111001101001111110010100111010111010110010010010111010111011010110111111111001011010101110000100111010101011001110000001111010111010101010101111111010001010001110010100 efa78bec82b3eca099efa78bec82b3ebbf8ae6848fec8e84eab585efa69fec8885eca09eefa78bebbaa3eca18ee69f94ebac92ebb5bfe5ab84eab381ebaaafe8a394
UHC 溜삳젙溜삳뿊意쎄굅烈숅젞溜뺣졎柔묒뵿嫄곁몯裔 1110101011111110101110111110101110100000100101011110101011111110101110111110101110010111100100011110101111110010101111011110101010110001101100001110011011101111100110011110100110100000100110001110101011111110100101011110101110100000101110111110101011110101100100011110110010010100101111011110101010110001101100001110011110010001100110011110011111100000 eafebbeba095eafebbeb9791ebf2bdeab1b0e6ef99e9a098eafe95eba0bbeaf591ec94bdeab1b0e79199e7e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)