To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 迪?????褐嚥疑?迪?????褐嚥疑?^ 1110011110001100001111110011111100111111001111110011111110001010100011001001101010001011100010110101111000111111111001111000110000111111001111110011111100111111001111111000101010001100100110101000101110001011010111100011111101011110 e78c3f3f3f3f3f8a8c9a8b8b5e3fe78c3f3f3f3f3f8a8c9a8b8b5e3f5e
EUC-JP 迪?????褐嚥疑?迪?????褐嚥疑?^ 1110110111101100001111110011111100111111001111110011111110110011111011001101001111101011101101011011111100111111111011011110110000111111001111110011111100111111001111111011001111101100110100111110101110110101101111110011111101011110 edec3f3f3f3f3fb3ecd3ebb5bf3fedec3f3f3f3f3fb3ecd3ebb5bf3f5e
UTF-8 迪쾅벳柳띌렫褐嚥疑뒷迪쾅벳柳띌렫褐嚥疑뒬^ 11101000101111111010101011101100101111101000010111101011101100101011001111101111101001111000100111101011100111011000110011101011101000001010101111101000101001001001000011100101100110101010010111100111100101101001000111101011100100101011011111101000101111111010101011101100101111101000010111101011101100101011001111101111101001111000100111101011100111011000110011101011101000001010101111101000101001001001000011100101100110101010010111100111100101101001000111101011100100101010110001011110 e8bfaaecbe85ebb2b3efa789eb9d8ceba0abe8a490e59aa5e79691eb92b7e8bfaaecbe85ebb2b3efa789eb9d8ceba0abe8a490e59aa5e79691eb92ac5e
UHC 迪쾅벳柳띌렫褐嚥疑뒷迪쾅벳柳띌렫褐嚥疑뒬^ 1110111011101000110001001110011110111010101010101110101011110111101101101110100110001110101110011100101011101000111001101011111111101011111101111011010111011110111011101110100011000100111001111011101010101010111010101111011110110110111010011000111010111001110010101110100011100110101111111110101111110111101101011101110001011110 eee8c4e7baaaeaf7b6e98eb9cae8e6bfebf7b5deeee8c4e7baaaeaf7b6e98eb9cae8e6bfebf7b5dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)