To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 鳶????ぐ倭??}鳶????ぐ倭??{^ 100100111100111000111111001111110011111100111111100000101010111010011000011000000011111100111111011111011001001111001110001111110011111100111111001111111000001010101110100110000110000000111111001111110111101101011110 93ce3f3f3f3f82ae98603f3f7d93ce3f3f3f3f82ae98603f3f7b5e
EUC-JP 鳶????ぐ倭??}鳶????ぐ倭??{^ 110001101101000000111111001111110011111100111111101001001011000011001111110000010011111100111111011111011100011011010000001111110011111100111111001111111010010010110000110011111100000100111111001111110111101101011110 c6d03f3f3f3fa4b0cfc13f3f7dc6d03f3f3f3fa4b0cfc13f3f7b5e
UTF-8 鳶면걶呂묋ぐ倭좄퀕}鳶면걶呂묋ぐ倭좄퀕{^ 111010011011001110110110111010111010100110110100111010101011000110110110111011111010011010000000111010111010110010001011111000111000000110010000111001011000000010101101111011001010001010000100111011011000000010010101011111011110100110110011101101101110101110101001101101001110101010110001101101101110111110100110100000001110101110101100100010111110001110000001100100001110010110000000101011011110110010100010100001001110110110000000100101010111101101011110 e9b3b6eba9b4eab1b6efa680ebac8be38190e580adeca284ed80957de9b3b6eba9b4eab1b6efa680ebac8be38190e580adeca284ed80957b5e
UHC 鳶면걶呂묋ぐ倭좄퀕}鳶면걶呂묋ぐ倭좄퀕{^ 111001101110100110111000111010011000000110011100111001011111101110010001111010001010101010110000111010001101111010100000111010001011001110001010011111011110011011101001101110001110100110000001100111001110010111111011100100011110100010101010101100001110100011011110101000001110100010110011100010100111101101011110 e6e9b8e9819ce5fb91e8aab0e8dea0e8b38a7de6e9b8e9819ce5fb91e8aab0e8dea0e8b38a7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)