To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 竣褐黴虞?渟?汀?竣褐黴虞?渟?汀?^ 10001111011101101000101010001100111010101000000010001011111100010011111110011111110110010011111110010010111100110011111110001111011101101000101010001100111010101000000010001011111100010011111110011111110110010011111110010010111100110011111101011110 8f768a8cea808bf13f9fd93f92f33f8f768a8cea808bf13f9fd93f92f33f5e
EUC-JP 竣褐黴虞?渟?汀?竣褐黴虞?渟?汀?^ 10111101110101111011001111101100111100111110000010110110111100110011111111011110110110110011111111000100111101010011111110111101110101111011001111101100111100111110000010110110111100110011111111011110110110110011111111000100111101010011111101011110 bdd7b3ecf3e0b6f33fdedb3fc4f53fbdd7b3ecf3e0b6f33fdedb3fc4f53f5e
UTF-8 竣褐黴虞렦渟렪汀뒷竣褐黴虞렦渟렪汀뒬^ 11100111101010111010001111101000101001001001000011101001101110111011010011101000100110011001111011101011101000001010011011100110101110001001111111101011101000001010101011100110101100011000000011101011100100101011011111100111101010111010001111101000101001001001000011101001101110111011010011101000100110011001111011101011101000001010011011100110101110001001111111101011101000001010101011100110101100011000000011101011100100101010110001011110 e7aba3e8a490e9bbb4e8999eeba0a6e6b89feba0aae6b180eb92b7e7aba3e8a490e9bbb4e8999eeba0a6e6b89feba0aae6b180eb92ac5e
UHC 竣褐黴虞렦渟렪汀뒷竣褐黴虞렦渟렪汀뒬^ 11110001111000101100101011101000110110101011110111101001111001011000111010110101111011111110010110001110101110001110111111100010101101011101111011110001111000101100101011101000110110101011110111101001111001011000111010110101111011111110010110001110101110001110111111100010101101011101110001011110 f1e2cae8dabde9e58eb5efe58eb8efe2b5def1e2cae8dabde9e58eb5efe58eb8efe2b5dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)