To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 閼ア螟夊∮隹キ蜈カ蟄ォ隹キ蟾ス螻櫁┳螟夊∮ 111010001000010010110001111001011010010010011010111010001000011110010011111010001011000010110111111001011000010110110110111001011010110110101011111010001011000010110111111001011011011110111101111001011011000110011110111010001000010010110001111001011010010010011010111010001000011110010011 e884b1e5a49ae88793e8b0b7e585b6e5adabe8b0b7e5b7bde5b19ee884b1e5a49ae88793
EUC-JP 閼ア螟夊?隹キ蜈カ蟄ォ隹キ蟾ス螻櫁┳螟夊? 11101111111001001000111010110001111010101010011011010100111010100011111111110000101100101000111010110111111010011110010110001110101101101110101010101111100011101010101111110000101100101000111010110111111010101011100110001110101111011110101010110011110111001110101010101000101100111110101010100110110101001110101000111111 efe48eb1eaa6d4ea3ff0b28eb7e9e58eb6eaaf8eabf0b28eb7eab98ebdeab3dceaa8b3eaa6d4ea3f
UTF-8 閼ア螟夊∮隹キ蜈カ蟄ォ隹キ蟾ス螻櫁┳螟夊∮ 111010011001011010111100111011111011110110110001111010001001111010011111111001011010010010001010111000101000100010101110111010011001101010111001111011111011110110110111111010001001110010001000111011111011110110110110111010001001111110000100111011111011110110101011111010011001101010111001111011111011110110110111111010001001111110111110111011111011110110111101111010001001111010111011111001101010101110000001111000101001010010110011111010001001111010011111111001011010010010001010111000101000100010101110 e996bcefbdb1e89e9fe5a48ae288aee99ab9efbdb7e89c88efbdb6e89f84efbdabe99ab9efbdb7e89fbeefbdbde89ebbe6ab81e294b3e89e9fe5a48ae288ae
UHC 閼?螟?∮??蜈?蟄???蟾???┳螟?∮ 111001001101100100111111110110011010110100111111101000101011000100111111001111111110100010100101001111111111011011011110001111110011111100111111111000001110101000111111001111110011111110100110101100111101100110101101001111111010001010110001 e4d93fd9ad3fa2b13f3fe8a53ff6de3f3f3fe0ea3f3f3fa6b3d9ad3fa2b1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)