To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 鼇??蜈??搖??v鼇??蜈??搖??vB 111010101000011100111111001111111110010110000101001111110011111110011101100010100011111100111111011101101110101010000111001111110011111111100101100001010011111100111111100111011000101000111111001111110111011001000010 ea873f3fe5853f3f9d8a3f3f76ea873f3fe5853f3f9d8a3f3f7642
EUC-JP 鼇??蜈??搖??v鼇??蜈??搖??vB 111100111110011100111111001111111110100111100101001111110011111111011001111010100011111100111111011101101111001111100111001111110011111111101001111001010011111100111111110110011110101000111111001111110111011001000010 f3e73f3fe9e53f3fd9ea3f3f76f3e73f3fe9e53f3fd9ea3f3f7642
UTF-8 鼇뤄쉈蜈랃쉐搖좄퓥v鼇뤄쉈蜈랃쉐搖좄퓥vB 111010011011110010000111111010111010010010000100111011001000100110001000111010001001110010001000111010111001111010000011111011001000100110010000111001101001000010010110111011001010001010000100111011011001001110100101011101101110100110111100100001111110101110100100100001001110110010001001100010001110100010011100100010001110101110011110100000111110110010001001100100001110011010010000100101101110110010100010100001001110110110010011101001010111011001000010 e9bc87eba484ec8988e89c88eb9e83ec8990e69096eca284ed93a576e9bc87eba484ec8988e89c88eb9e83ec8990e69096eca284ed93a57642
UHC 鼇뤄쉈蜈랃쉐搖좄퓥v鼇뤄쉈蜈랃쉐搖좄퓥vB 111010001010100010110111111011111011110110100101111010001010010110001101111011111011110110100110111010001111010010100000111010001011111110001110011101101110100010101000101101111110111110111101101001011110100010100101100011011110111110111101101001101110100011110100101000001110100010111111100011100111011001000010 e8a8b7efbda5e8a58defbda6e8f4a0e8bf8e76e8a8b7efbda5e8a58defbda6e8f4a0e8bf8e7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)