To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????^W????^Jn}????^W????^Jn{^ 0011111100111111001111110011111101011110010101110011111100111111001111110011111101011110010010100110111001111101001111110011111100111111001111110101111001010111001111110011111100111111001111110101111001001010011011100111101101011110 3f3f3f3f5e573f3f3f3f5e4a6e7d3f3f3f3f5e573f3f3f3f5e4a6e7b5e
SJIS-WIN 惟?障?^W惟?障?^Jn}惟?障?^W惟?障?^Jn{^ 10001000110100100011111110001111111000010011111101011110010101111000100011010010001111111000111111100001001111110101111001001010011011100111110110001000110100100011111110001111111000010011111101011110010101111000100011010010001111111000111111100001001111110101111001001010011011100111101101011110 88d23f8fe13f5e5788d23f8fe13f5e4a6e7d88d23f8fe13f5e5788d23f8fe13f5e4a6e7b5e
EUC-JP 惟?障?^W惟?障?^Jn}惟?障?^W惟?障?^Jn{^ 10110000110101000011111110111110111000110011111101011110010101111011000011010100001111111011111011100011001111110101111001001010011011100111110110110000110101000011111110111110111000110011111101011110010101111011000011010100001111111011111011100011001111110101111001001010011011100111101101011110 b0d43fbee33f5e57b0d43fbee33f5e4a6e7db0d43fbee33f5e57b0d43fbee33f5e4a6e7b5e
UTF-8 惟렋障렚^W惟렋障렚^Jn}惟렋障렚^W惟렋障렚^Jn{^ 11100110100000111001111111101011101000001000101111101001100110101001110011101011101000001001101001011110010101111110011010000011100111111110101110100000100010111110100110011010100111001110101110100000100110100101111001001010011011100111110111100110100000111001111111101011101000001000101111101001100110101001110011101011101000001001101001011110010101111110011010000011100111111110101110100000100010111110100110011010100111001110101110100000100110100101111001001010011011100111101101011110 e6839feba08be99a9ceba09a5e57e6839feba08be99a9ceba09a5e4a6e7de6839feba08be99a9ceba09a5e57e6839feba08be99a9ceba09a5e4a6e7b5e
UHC 惟렋障렚^W惟렋障렚^Jn}惟렋障렚^W惟렋障렚^Jn{^ 111010101110111010001110101000101110111010100001100011101010110101011110010101111110101011101110100011101010001011101110101000011000111010101101010111100100101001101110011111011110101011101110100011101010001011101110101000011000111010101101010111100101011111101010111011101000111010100010111011101010000110001110101011010101111001001010011011100111101101011110 eaee8ea2eea18ead5e57eaee8ea2eea18ead5e4a6e7deaee8ea2eea18ead5e57eaee8ea2eea18ead5e4a6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)