To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嶸??飮?????潁??泣?????嶸??泣 11111010101101000011111100111111100111110101101000111111001111110011111100111111001111111001111111110001001111110011111110001011100000110011111100111111001111110011111100111111111110101011010000111111001111111000101110000011 fab43f3f9f5a3f3f3f3f3f9ff13f3f8b833f3f3f3f3ffab43f3f8b83
EUC-JP 嶸??飮??洧??潁??泣?????嶸??泣 1000111110111011111101000011111100111111110111011011101100111111001111111000111111000111101101000011111100111111110111101111001100111111001111111011010111100011001111110011111100111111001111110011111110001111101110111111010000111111001111111011010111100011 8fbbf43f3fddbb3f3f8fc7b43f3fdef33f3fb5e33f3f3f3f3f8fbbf43f3fb5e3
UTF-8 嶸뗭옚飮꿨쮦洧곗뒪潁뺢랬泣ⓩ를琉꾩뒴嶸뗭옚泣 111001011011011010111000111010111001011110101101111011001001100010011010111010011010001110101110111010101011111110101000111011001010111010100110111001101011010010100111111010101011001110010111111010111001001010101010111001101011110110000001111010111011101010100010111010111001111010101100111001101011001110100011111000101001001110101001111010111010010110111100111011111010011110001100111010101011111010101001111010111001001010110100111001011011011010111000111010111001011110101101111011001001100010011010111001101011001110100011 e5b6b8eb97adec989ae9a3aeeabfa8ecaea6e6b4a7eab397eb92aae6bd81ebbaa2eb9eace6b3a3e293a9eba5bcefa78ceabea9eb92b4e5b6b8eb97adec989ae6b3a3
UHC 嶸뗭옚飮꿨쮦洧곗뒪潁뺢랬泣ⓩ를琉꾩뒴嶸뗭옚泣 1110011110101110100010111110110010011110100111101110101111100110101100101110010110101000100000111110101011111011101100001110110010001010101001001110011110111000100101011110101010110111101010001110101111101000101010001110011010111000101001101110101110100100100001001110110010001010101011011110011110101110100010111110110010011110100111101110101111101000 e7ae8bec9e9eebe6b2e5a883eafbb0ec8aa4e7b895eab7a8ebe8a8e6b8a6eba484ec8aade7ae8bec9e9eebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)