To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 弱??厓ц?鈺ц?弱??厓ц?鈺ц?^ 1000111011100011001111110011111111111010100011011000010010001000001111111111101111000100100001001000100000111111100011101110001100111111001111111111101010001101100001001000100000111111111110111100010010000100100010000011111101011110 8ee33f3ffa8d84883ffbc484883f8ee33f3ffa8d84883ffbc484883f5e
EUC-JP 弱??厓ц?鈺ц?弱??厓ц?鈺ц?^ 101111001110010100111111001111111000111110110100110001111010011111101000001111111000111111100011110101011010011111101000001111111011110011100101001111110011111110001111101101001100011110100111111010000011111110001111111000111101010110100111111010000011111101011110 bce53f3f8fb4c7a7e83f8fe3d5a7e83fbce53f3f8fb4c7a7e83f8fe3d5a7e83f5e
UTF-8 弱놅쉽厓ц춶鈺ц븪弱놅쉽厓ц춶鈺ц몚^ 111001011011110010110001111010111000011010000101111011001000100110111101111001011000111010010011110100011000011011101100101101101011011011101001100010001011101011010001100001101110101110111000101010101110010110111100101100011110101110000110100001011110110010001001101111011110010110001110100100111101000110000110111011001011011010110110111010011000100010111010110100011000011011101011101010101001101001011110 e5bcb1eb8685ec89bde58e93d186ecb6b6e988bad186ebb8aae5bcb1eb8685ec89bde58e93d186ecb6b6e988bad186ebaa9a5e
UHC 弱놅쉽厓ц춶鈺ц븪弱놅쉽厓ц춶鈺ц몚^ 11100101101100001000011011101111101111011011000111100100111011011010110011101000101011011001001011101000101011011010110011101000100101011001001111100101101100001000011011101111101111011011000111100100111011011010110011101000101011011001001011101000101011011010110011101000100100011000100001011110 e5b086efbdb1e4edace8ad92e8adace89593e5b086efbdb1e4edace8ad92e8adace891885e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)