To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???×^???×K???×^???×K^ 001111110011111100111111110101110101111000111111001111110011111111010111010010110011111100111111001111111101011101011110001111110011111100111111110101110100101101011110 3f3f3fd75e3f3f3fd74b3f3f3fd75e3f3f3fd74b5e
SJIS-WIN 紆?弧×^紆?弧×K紆?弧×^紆?弧×K^ 111000101111110000111111100011001100101010000001011111100101111011100010111111000011111110001100110010101000000101111110010010111110001011111100001111111000110011001010100000010111111001011110111000101111110000111111100011001100101010000001011111100100101101011110 e2fc3f8cca817e5ee2fc3f8cca817e4be2fc3f8cca817e5ee2fc3f8cca817e4b5e
EUC-JP 紆?弧×^紆?弧×K紆?弧×^紆?弧×K^ 111001001111111000111111101110001100110010100001110111110101111011100100111111100011111110111000110011001010000111011111010010111110010011111110001111111011100011001100101000011101111101011110111001001111111000111111101110001100110010100001110111110100101101011110 e4fe3fb8cca1df5ee4fe3fb8cca1df4be4fe3fb8cca1df5ee4fe3fb8cca1df4b5e
UTF-8 紆렡弧×^紆렡弧×K紆렡弧×^紆렡弧×K^ 11100111101101001000011011101011101000001010000111100101101111001010011111000011100101110101111011100111101101001000011011101011101000001010000111100101101111001010011111000011100101110100101111100111101101001000011011101011101000001010000111100101101111001010011111000011100101110101111011100111101101001000011011101011101000001010000111100101101111001010011111000011100101110100101101011110 e7b486eba0a1e5bca7c3975ee7b486eba0a1e5bca7c3974be7b486eba0a1e5bca7c3975ee7b486eba0a1e5bca7c3974b5e
UHC 紆렡弧×^紆렡弧×K紆렡弧×^紆렡弧×K^ 11101001111000011000111010110010111110111100000110100001101111110101111011101001111000011000111010110010111110111100000110100001101111110100101111101001111000011000111010110010111110111100000110100001101111110101111011101001111000011000111010110010111110111100000110100001101111110100101101011110 e9e18eb2fbc1a1bf5ee9e18eb2fbc1a1bf4be9e18eb2fbc1a1bf5ee9e18eb2fbc1a1bf4b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)