To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 擾??陰??吟? 1000111111101111001111110011111110001001010000010011111100111111100010111110000100111111 8fef3f3f89413f3f8be13f
EUC-JP 擾??陰??吟? 1011111011110001001111110011111110110001101000100011111100111111101101101110001100111111 bef13f3fb1a23f3fb6e33f
UTF-8 擾살꽱陰뽰꽱吟킕 111001101001001110111110111011001000001010110100111010101011110110110001111010011001100110110000111010111011110110110000111010101011110110110001111001011001000010011111111011011000001010010101 e693beec82b4eabdb1e999b0ebbdb0eabdb1e5909fed8295
UHC 擾살꽱陰뽰꽱吟킕 11101000111101101011101111101100100001001011110011101011111001001001011011101100100001001011110011101011111000011011010101000001 e8f6bbec84bcebe496ec84bcebe1b541

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)