To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???溢??揄????????應ш?厓??竊 00111111001111110011111110001000111011000011111100111111100111011000100100111111001111110011111100111111001111110011111100111111001111111001110011100100100001001000101000111111111110101000110100111111001111111110001010000110 3f3f3f88ec3f3f9d893f3f3f3f3f3f3f3f9ce4848a3ffa8d3f3fe286
EUC-JP ???溢??揄?????嫄??應ш?厓??竊 00111111001111110011111110110000111011100011111100111111110110011110100100111111001111110011111100111111001111111000111110111010101000010011111100111111110110001110011010100111111010100011111110001111101101001100011100111111001111111110001111100110 3f3f3fb0ee3f3fd9e93f3f3f3f3f8fbaa13f3fd8e6a7ea3f8fb4c73f3fe3e6
UTF-8 列룸벝溢당뙴揄명맭列룸뱪嫄뽫㎣應ш괌厓쀢뫖竊 1110111110100110100111001110101110100011101110001110101110110010100111011110011010111010101000101110101110001011101110011110101110011001101101001110011010001111100001001110101110101010100001011110101110100111101011011110111110100110100111001110101110100011101110001110101110110001101010101110010110101011100001001110101110111101101010111110001110001110101000111110011010000111100010011101000110001000111010101011010010001100111001011000111010010011111011001000000010100010111010111010101110010110111001111010101110001010 efa69ceba3b8ebb29de6baa2eb8bb9eb99b4e68f84ebaa85eba7adefa69ceba3b8ebb1aae5ab84ebbdabe38ea3e68789d188eab48ce58e93ec80a2ebab96e7ab8a
UHC 列룸벝溢당뙴揄명맭列룸뱪嫄뽫㎣應ш괌厓쀢뫖竊 1110011011101010101101111110101110010011101110001110110011101110101101001110011110001100101101111110101011110001101110001110110110010000101101001110011011101010101101111110101110010011100100001110101010110001100101101110011110100111101001111110101111101011101011001110101010110001101000011110010011101101100101111110001010010001101110001110111110111100 e6eab7eb93b8eceeb4e78cb7eaf1b8ed90b4e6eab7eb9390eab196e7a7a7ebebaceab1a1e4ed97e291b8efbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)