To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????@???????????@B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100000000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100000001000010 3f3f3f3f3f3f3f3f3f3f3f403f3f3f3f3f3f3f3f3f3f3f4042
SJIS-WIN 閭・髫丈ソョ閼帙Ρ菫ョ@閭・髫丈ソョ閼帙Ρ菫ョ@B 111010001000001110100101111010011001101010001111111001001011111110101110111010001000010010011011111000111000001110101111111001001011111110101110010000001110100010000011101001011110100110011010100011111110010010111111101011101110100010000100100110111110001110000011101011111110010010111111101011100100000001000010 e883a5e99a8fe4bfaee8849be383afe4bfae40e883a5e99a8fe4bfaee8849be383afe4bfae4042
EUC-JP 閭・髫丈ソョ閼帙Ρ菫ョ@閭・髫丈ソョ閼帙Ρ菫ョ@B 1110111111100011100011101010010111110001111110101011111011100110100011101011111110001110101011101110111111100100110101101110010110100110101100011110100011000001100011101010111001000000111011111110001110001110101001011111000111111010101111101110011010001110101111111000111010101110111011111110010011010110111001011010011010110001111010001100000110001110101011100100000001000010 efe38ea5f1fabee68ebf8eaeefe4d6e5a6b1e8c18eae40efe38ea5f1fabee68ebf8eaeefe4d6e5a6b1e8c18eae4042
UTF-8 閭・髫丈ソョ閼帙Ρ菫ョ@閭・髫丈ソョ閼帙Ρ菫ョ@B 11101001100101101010110111101111101111011010010111101001101010111010101111100100101110001000100011101111101111011011111111101111101111011010111011101001100101101011110011100101101110001001100111001110101000011110100010001111101010111110111110111101101011100100000011101001100101101010110111101111101111011010010111101001101010111010101111100100101110001000100011101111101111011011111111101111101111011010111011101001100101101011110011100101101110001001100111001110101000011110100010001111101010111110111110111101101011100100000001000010 e996adefbda5e9ababe4b888efbdbfefbdaee996bce5b899cea1e88fabefbdae40e996adefbda5e9ababe4b888efbdbfefbdaee996bce5b899cea1e88fabefbdae4042
UHC 閭??丈??閼帙Ρ菫?@閭??丈??閼帙Ρ菫?@B 11010101111011110011111100111111111011011101101100111111001111111110010011011001111100101110110110100101110100011101000011001011001111110100000011010101111011110011111100111111111011011101101100111111001111111110010011011001111100101110110110100101110100011101000011001011001111110100000001000010 d5ef3f3feddb3f3fe4d9f2eda5d1d0cb3f40d5ef3f3feddb3f3fe4d9f2eda5d1d0cb3f4042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)