To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 鼇??箋∽?鈺ワ?v鼇??箋∽?鈺ワ?vB 11101010100001110011111100111111111000101011001110000001111001000011111111111011110001001000001110001111001111110111011011101010100001110011111100111111111000101011001110000001111001000011111111111011110001001000001110001111001111110111011001000010 ea873f3fe2b381e43ffbc4838f3f76ea873f3fe2b381e43ffbc4838f3f7642
EUC-JP 鼇??箋∽?鈺ワ?v鼇??箋∽?鈺ワ?vB 111100111110011100111111001111111110010010110101101000101110011000111111100011111110001111010101101001011110111100111111011101101111001111100111001111110011111111100100101101011010001011100110001111111000111111100011110101011010010111101111001111110111011001000010 f3e73f3fe4b5a2e63f8fe3d5a5ef3f76f3e73f3fe4b5a2e63f8fe3d5a5ef3f7642
UTF-8 鼇뤄쉭箋∽슈鈺ワ슥v鼇뤄쉭箋∽슈鈺ワ슥vB 111010011011110010000111111010111010010010000100111011001000100110101101111001111010111010001011111000101000100010111101111011001000101010001000111010011000100010111010111000111000001110101111111011001000101010100101011101101110100110111100100001111110101110100100100001001110110010001001101011011110011110101110100010111110001010001000101111011110110010001010100010001110100110001000101110101110001110000011101011111110110010001010101001010111011001000010 e9bc87eba484ec89ade7ae8be288bdec8a88e988bae383afec8aa576e9bc87eba484ec89ade7ae8be288bdec8a88e988bae383afec8aa57642
UHC 鼇뤄쉭箋∽슈鈺ワ슥v鼇뤄쉭箋∽슈鈺ワ슥vB 111010001010100010110111111011111011110110101101111011111010100010100001111011111011110110110100111010001010110110101011111011111011110110111011011101101110100010101000101101111110111110111101101011011110111110101000101000011110111110111101101101001110100010101101101010111110111110111101101110110111011001000010 e8a8b7efbdadefa8a1efbdb4e8adabefbdbb76e8a8b7efbdadefa8a1efbdb4e8adabefbdbb7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)