To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????S??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010100110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f533f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?壙?????????S?壙????????B 00111111100110101101101100111111001111110011111100111111001111110011111100111111001111110011111101010011001111111001101011011011001111110011111100111111001111110011111100111111001111110011111101000010 3f9adb3f3f3f3f3f3f3f3f3f533f9adb3f3f3f3f3f3f3f3f42
EUC-JP ?壙?????????S?壙????????B 00111111110101001101110100111111001111110011111100111111001111110011111100111111001111110011111101010011001111111101010011011101001111110011111100111111001111110011111100111111001111110011111101000010 3fd4dd3f3f3f3f3f3f3f3f3f533fd4dd3f3f3f3f3f3f3f3f42
UTF-8 렻壙렊렻렟렻렔렻렟렻렖S렻壙렊렻파렔렻렧렻뽁B 1110101110100000101110111110010110100011100110011110101110100000100010101110101110100000101110111110101110100000100111111110101110100000101110111110101110100000100101001110101110100000101110111110101110100000100111111110101110100000101110111110101110100000100101100101001111101011101000001011101111100101101000111001100111101011101000001000101011101011101000001011101111101101100011001000110011101011101000001001010011101011101000001011101111101011101000001010011111101011101000001011101111101011101111011000000101000010 eba0bbe5a399eba08aeba0bbeba09feba0bbeba094eba0bbeba09feba0bbeba09653eba0bbe5a399eba08aeba0bbed8c8ceba094eba0bbeba0a7eba0bbebbd8142
UHC 렻壙렊렻렟렻렔렻렟렻렖S렻壙렊렻파렔렻렧렻뽁B 1000111011000011110011101100010110001110101000011000111011000011100011101011000010001110110000111000111010101001100011101100001110001110101100001000111011000011100011101010101101010011100011101100001111001110110001011000111010100001100011101100001111000110110001001000111010101001100011101100001110001110101101101000111011000011101110111100100001000010 8ec3cec58ea18ec38eb08ec38ea98ec38eb08ec38eab538ec3cec58ea18ec3c6c48ea98ec38eb68ec3bbc842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)