To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????z???????????zB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111101000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 將?虞?除??虞?吟?z將?虞?除??虞?吟?zB 1001101110010010001111111000101111110001001111111000111110011100001111110011111110001011111100010011111110001011111000010011111101111010100110111001001000111111100010111111000100111111100011111001110000111111001111111000101111110001001111111000101111100001001111110111101001000010 9b923f8bf13f8f9c3f3f8bf13f8be13f7a9b923f8bf13f8f9c3f3f8bf13f8be13f7a42
EUC-JP 將?虞?除??虞?吟?z將?虞?除??虞?吟?zB 1101010111110010001111111011011011110011001111111011110111111100001111110011111110110110111100110011111110110110111000110011111101111010110101011111001000111111101101101111001100111111101111011111110000111111001111111011011011110011001111111011011011100011001111110111101001000010 d5f23fb6f33fbdfc3f3fb6f33fb6e33f7ad5f23fb6f33fbdfc3f3fb6f33fb6e33f7a42
UTF-8 將렚虞렧除곈ㄿ虞렧吟렞z將렚虞렧除곈ㄿ虞렧吟렞zB 111001011011000010000111111010111010000010011010111010001001100110011110111010111010000010100111111010011001100110100100111010101011001110001000111000111000010010111111111010001001100110011110111010111010000010100111111001011001000010011111111010111010000010011110011110101110010110110000100001111110101110100000100110101110100010011001100111101110101110100000101001111110100110011001101001001110101010110011100010001110001110000100101111111110100010011001100111101110101110100000101001111110010110010000100111111110101110100000100111100111101001000010 e5b087eba09ae8999eeba0a7e999a4eab388e384bfe8999eeba0a7e5909feba09e7ae5b087eba09ae8999eeba0a7e999a4eab388e384bfe8999eeba0a7e5909feba09e7a42
UHC 將렚虞렧除곈ㄿ虞렧吟렞z將렚虞렧除곈ㄿ虞렧吟렞zB 1110110111100010100011101010110111101001111001011000111010110110111100001011011010110000111010011010010010101111111010011110010110001110101101101110101111100001100011101010111101111010111011011110001010001110101011011110100111100101100011101011011011110000101101101011000011101001101001001010111111101001111001011000111010110110111010111110000110001110101011110111101001000010 ede28eade9e58eb6f0b6b0e9a4afe9e58eb6ebe18eaf7aede28eade9e58eb6f0b6b0e9a4afe9e58eb6ebe18eaf7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)