To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 翁???莊???茸楔翁???莊???茸?^ 10001001101001010011111100111111001111111110010010110101001111110011111100111111100100011111100110011110101101101000100110100101001111110011111100111111111001001011010100111111001111110011111110010001111110010011111101011110 89a53f3f3fe4b53f3f3f91f99eb689a53f3f3fe4b53f3f3f91f93f5e
EUC-JP 翁???莊??焞茸楔翁???莊??焞茸?^ 1011001010100111001111110011111100111111111010001011011100111111001111111000111111001001111010101100001011111011110111001011100010110010101001110011111100111111001111111110100010110111001111110011111110001111110010011110101011000010111110110011111101011110 b2a73f3f3fe8b73f3f8fc9eac2fbdcb8b2a73f3f3fe8b73f3f8fc9eac2fb3f5e
UTF-8 翁골렰렑莊렱뤯焞茸楔翁골렰렑莊렱뤯焞茸卨^ 11100111101111111000000111101010101100111010100011101011101000001011000011101011101000001001000111101000100011101000101011101011101000001011000111101011101001001010111111100111100001001001111011101000100011001011100011100110101001011001010011100111101111111000000111101010101100111010100011101011101000001011000011101011101000001001000111101000100011101000101011101011101000001011000111101011101001001010111111100111100001001001111011101000100011001011100011100101100011011010100001011110 e7bf81eab3a8eba0b0eba091e88e8aeba0b1eba4afe7849ee88cb8e6a594e7bf81eab3a8eba0b0eba091e88e8aeba0b1eba4afe7849ee88cb8e58da85e
UHC 翁골렰렑莊렱뤯焞茸楔翁골렰렑莊렱뤯焞茸卨^ 1110100010111010101100001111000110001110101111011000111010100110111011011111011010001110101111101000111111011101110101001100100011101001110001111110000011011011111010001011101010110000111100011000111010111101100011101010011011101101111101101000111010111110100011111101110111010100110010001110100111000111111000001101100101011110 e8bab0f18ebd8ea6edf68ebe8fddd4c8e9c7e0dbe8bab0f18ebd8ea6edf68ebe8fddd4c8e9c7e0d95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)