To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?る???ⅱ鷹??嚴щ???Ⅷ膺??厭?? 0011111110000010111010010011111100111111001111111111101001000001100100011110100100111111001111111001101010001110100001001000101100111111001111110011111110000111010110111110010001011110001111110011111110001001011111010011111100111111 3f82e93f3f3ffa4191e93f3f9a8e848b3f3f3f875be45e3f3f897d3f3f
EUC-JP ?る?堉??鷹??嚴щ????膺??厭?? 0011111110100100111010110011111110001111101101111111110100111111001111111100001011101011001111110011111111010011111011101010011111101011001111110011111100111111001111111110011110111111001111110011111110110001110111100011111100111111 3fa4eb3f8fb7fd3f3fc2eb3f3fd3eea7eb3f3f3f3fe7bf3f3fb1de3f3f
UTF-8 閭る벡堉붹ⅱ鷹숈춶嚴щ베類꾬Ⅷ膺꾪뜐厭묐썑 1110111110100110100001101110001110000010100010111110101110110010101000011110010110100000100010011110101110110110101110011110001010000101101100011110100110110111101110011110110010001000100010001110110010110110101101101110010110011010101101001101000110001001111010111011001010100000111011111010011110010000111010101011111010101100111000101000010110100111111010001000011010111010111010101011111010101010111010111001110010010000111001011000111010101101111010111010110010010000111011001000110110010001 efa686e3828bebb2a1e5a089ebb6b9e285b1e9b7b9ec8888ecb6b6e59ab4d189ebb2a0efa790eabeace285a7e886baeabeaaeb9c90e58eadebac90ec8d91
UHC 閭る벡堉붹ⅱ鷹숈춶嚴щ베類꾬Ⅷ膺꾪뜐厭묐썑 111001101010110110101010111010111011101010100100111010111011110010010100111001101010010110100010111010111110110110011001111011001010110110010010111001011111000110101100111010111011101010100011111010111011101010000100111011111010010110110111111010111110110010000100111011011000110110010011111001101111010010010001111010111001101110000100 e6adaaebbaa4ebbc94e6a5a2ebed99ecad92e5f1acebbaa3ebba84efa5b7ebec84ed8d93e6f491eb9b84

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)