To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????u}????????u{^ 001111110011111100111111001111110011111100111111001111110011111101110101011111010011111100111111001111110011111100111111001111110011111100111111011101010111101101011110 3f3f3f3f3f3f3f3f757d3f3f3f3f3f3f3f3f757b5e
SJIS-WIN 旭??豆??絅?u}旭??豆??絅?u{^ 100010001010111000111111001111111001001110100100001111110011111111100011010001000011111101110101011111011000100010101110001111110011111110010011101001000011111100111111111000110100010000111111011101010111101101011110 88ae3f3f93a43f3fe3443f757d88ae3f3f93a43f3fe3443f757b5e
EUC-JP 旭?祜豆?饔絅?u}旭?祜豆?饔絅?u{^ 1011000010110000001111111000111111010000110110001100011010100110001111111000111111101000111011111110010110100101001111110111010101111101101100001011000000111111100011111101000011011000110001101010011000111111100011111110100011101111111001011010010100111111011101010111101101011110 b0b03f8fd0d8c6a63f8fe8efe5a53f757db0b03f8fd0d8c6a63f8fe8efe5a53f757b5e
UTF-8 旭렔祜豆쑹饔絅쑹u}旭렔祜豆쑹饔絅쑹u{^ 1110011010010111101011011110101110100000100101001110011110100101100111001110100010110001100001101110110010010001101110011110100110100101100101001110011110110101100001011110110010010001101110010111010101111101111001101001011110101101111010111010000010010100111001111010010110011100111010001011000110000110111011001001000110111001111010011010010110010100111001111011010110000101111011001001000110111001011101010111101101011110 e697adeba094e7a59ce8b186ec91b9e9a594e7b585ec91b9757de697adeba094e7a59ce8b186ec91b9e9a594e7b585ec91b9757b5e
UHC 旭렔祜豆쑹饔絅쑹u}旭렔祜豆쑹饔絅쑹u{^ 11101001111011111000111010101001111110111101010011010100111001111011111010101011111010001011110111001100111001111011111010101011011101010111110111101001111011111000111010101001111110111101010011010100111001111011111010101011111010001011110111001100111001111011111010101011011101010111101101011110 e9ef8ea9fbd4d4e7beabe8bdcce7beab757de9ef8ea9fbd4d4e7beabe8bdcce7beab757b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)