To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 櫻?????儀??蘂??異??邑????? 1001111101001110001111110011111100111111001111110011111110001011010101100011111100111111111001010100000100111111001111111000100011011001001111110011111110010111010101110011111100111111001111110011111100111111 9f4e3f3f3f3f3f8b563f3fe5413f3f88d93f3f97573f3f3f3f3f
EUC-JP 櫻?????儀??蘂??異??邑????? 1101110110101111001111110011111100111111001111110011111110110101101101110011111100111111111010011010001000111111001111111011000011011011001111110011111111001101101110000011111100111111001111110011111100111111 ddaf3f3f3f3f3fb5b73f3fe9a23f3fb0db3f3fcdb83f3f3f3f3f
UTF-8 櫻뗣굠杻ⓨ쮦儀륁쭖蘂띠꼪異쇗콨邑㏃뵋略노쑤 111001101010101110111011111010111001011110100011111010101011010110100000111011111010011110001000111000101001001110101000111011001010111010100110111001011000010010000000111010111010010110000001111011001010110110010110111010001001100010000010111010111001110110100000111010101011110010101010111001111001010110110000111011001000011110010111111011001011110110101000111010011000001010010001111000111000111110000011111010111011010110001011111011111010010110110110111010111000010110111000111011001001000110100100 e6abbbeb97a3eab5a0efa788e293a8ecaea6e58480eba581ecad96e89882eb9da0eabcaae795b0ec8797ecbda8e98291e38f83ebb58befa5b6eb85b8ec91a4
UHC 櫻뗣굠杻ⓨ쮦儀륁쭖蘂띠꼪異쇗콨邑㏃뵋略노쑤 111001011010000110001011111000111000001010001000111010101111010010101000111001011010100010000011111010111111000010001111111011001010011110001110111001111101111010110110111011001000010010000111111011001011011010111100111001101011000110011101111010111110100110100111111011001001010010001111111001011011001010110011111010111011111010100101 e5a18be38288eaf4a8e5a883ebf08feca78ee7deb6ec8487ecb6bce6b19debe9a7ec948fe5b2b3ebbea5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)