To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 弱??榮????????昻??諭 1000111011100011001111110011111110011110110001000011111100111111001111110011111100111111001111110011111100111111111110101101000000111111001111111001011101000000 8ee33f3f9ec43f3f3f3f3f3f3f3ffad03f3f9740
EUC-JP 弱??榮???????????諭 10111100111001010011111100111111110111001100011000111111001111110011111100111111001111110011111100111111001111110011111100111111001111111100110110100001 bce53f3fdcc63f3f3f3f3f3f3f3f3f3f3fcda1
UTF-8 弱뉖젡榮녿젫惡욌젳凉붾졁昻뽰푷諭 111001011011110010110001111010111000100110010110111011001010000010100001111001101010011010101110111010111000010110111111111011001010000010101011111011111010011010111001111011001001101010001100111011001010000010110011111011111010010110111001111010111011011010111110111011001010000110000001111001101001100010111011111010111011110110110000111011011001000110110111111010001010101110101101 e5bcb1eb8996eca0a1e6a6aeeb85bfeca0abefa6b9ec9a8ceca0b3efa5b9ebb6beeca181e698bbebbdb0ed91b7e8abad
UHC 弱뉖젡榮녿젫惡욌젳凉붾졁昻뽰푷諭 1110010110110000100001111110101110100000100110101110011110110100100001101110101110100000101000111110011111110111100111101110101110100000101001111110010110111100100101001110101110100000101100101110010011101001100101101110110010111110100001011110101110110001 e5b087eba09ae7b486eba0a3e7f79eeba0a7e5bc94eba0b2e4e996ecbe85ebb1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)