To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????BF 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f4246
SJIS-WIN 鴉??誼??儒??癲??誼?BF 111010011110101100111111001111111000101101100010001111110011111110001110111100100011111100111111111000011001111100111111001111111000101101100010001111110100001001000110 e9eb3f3f8b623f3f8ef23f3fe19f3f3f8b623f4246
EUC-JP 鴉??誼??儒??癲??誼?BF 111100101110110100111111001111111011010111000011001111110011111110111100111101000011111100111111111000101010000100111111001111111011010111000011001111110100001001000110 f2ed3f3fb5c33f3fbcf43f3fe2a13f3fb5c33f4246
UTF-8 鴉띻낮誼쎿듉儒우젲癲싳뼔誼뾬BF 1110100110110100100010011110101110011101101110111110101110000010101011101110100010101010101111001110110010001110101111111110101110010011100010011110010110000100100100101110110010011010101100001110110010100000101100101110011110011001101100101110110010001011101100111110101110111100100101001110100010101010101111001110101110111110101011000100001001000110 e9b489eb9dbbeb82aee8aabcec8ebfeb9389e58492ec9ab0eca0b2e799b2ec8bb3ebbc94e8aabcebbeac4246
UHC 鴉띻낮誼쎿듉儒우젲癲싳뼔誼뾬BF 111001001011110010001101111010101011001110110111111010111111111010011011111001101000101010111100111010101110001110111111111011001010000010100110111011111010011010011010111011001001011010011100111010111111111010010111011011110100001001000110 e4bc8deab3b7ebfe9be68abceae3bfeca0a6efa69aec969cebfe976f4246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)