To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴦??????異??節??沃???嚥▲?竊 11101001111100010011111100111111001111110011111100111111001111111000100011011001001111110011111110010000110111110011111100111111100101111000000000111111001111110011111110011010100010111000000110100011001111111110001010000110 e9f13f3f3f3f3f3f88d93f3f90df3f3f97803f3f3f9a8b81a33fe286
EUC-JP 鴦???孼??異??節??沃???嚥▲?竊 111100101111001100111111001111110011111110001111101110101100001100111111001111111011000011011011001111110011111111000000111000010011111100111111110011011110000000111111001111110011111111010011111010111010001010100101001111111110001111100110 f2f33f3f3f8fbac33f3fb0db3f3fc0e13f3fcde03f3f3fd3eba2a53fe3e6
UTF-8 鴦꾆쇰뼕孼뽯쓬異녘쥈節뗭쭖沃왥욍럸嚥▲룗竊 111010011011010010100110111010101011111010000110111011001000011110110000111010111011110010010101111001011010110110111100111010111011110110101111111011001001001110101100111001111001010110110000111010111000010110011000111011001010010110001000111001111010111110000000111010111001011110101101111011001010110110010110111001101011001010000011111011001001100110100101111011001001101010001101111010111001111110111000111001011001101010100101111000101001011010110010111010111010001110010111111001111010101110001010 e9b4a6eabe86ec87b0ebbc95e5adbcebbdafec93ace795b0eb8598eca588e7af80eb97adecad96e6b283ec99a5ec9a8deb9fb8e59aa5e296b2eba397e7ab8a
UHC 鴦꾆쇰뼕孼뽯쓬異녘쥈節뗭쭖沃왥욍럸嚥▲룗竊 111001001110110010000100110011101011110011101011100101101001110111100101111011011001011011101011100111011000110011101100101101101011001111101000101000101000000111101111101111011000101111101100101001111000111011101000101010101001111011001110101111111110001110001110100101111110011010111111101000011110001110001111100100111110111110111100 e4ec84cebceb969de5ed96eb9d8cecb6b3e8a281efbd8beca78ee8aa9ecebfe38e97e6bfa1e38f93efbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)