To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??D???????}??D???????{^ 0011111100111111010001000011111100111111001111110011111100111111001111110011111101111101001111110011111101000100001111110011111100111111001111110011111100111111001111110111101101011110 3f3f443f3f3f3f3f3f3f7d3f3f443f3f3f3f3f3f3f7b5e
SJIS-WIN 障?D吟?源?低逗工}障?D吟?源?低逗工{^ 1000111111100001001111110100010010001011111000010011111110001100101110010011111110010010111000011001000010000000100011010100100001111101100011111110000100111111010001001000101111100001001111111000110010111001001111111001001011100001100100001000000010001101010010000111101101011110 8fe13f448be13f8cb93f92e190808d487d8fe13f448be13f8cb93f92e190808d487b5e
EUC-JP 障?D吟?源?低逗工}障?D吟?源?低逗工{^ 1011111011100011001111110100010010110110111000110011111110111000101110110011111111000100111000111011111111100000101110011010100101111101101111101110001100111111010001001011011011100011001111111011100010111011001111111100010011100011101111111110000010111001101010010111101101011110 bee33f44b6e33fb8bb3fc4e3bfe0b9a97dbee33f44b6e33fb8bb3fc4e3bfe0b9a97b5e
UTF-8 障렚D吟렞源렰低逗工}障렚D吟렞源렰低逗工{^ 1110100110011010100111001110101110100000100110100100010011100101100100001001111111101011101000001001111011100110101110101001000011101011101000001011000011100100101111011000111011101001100000001001011111100101101101111010010101111101111010011001101010011100111010111010000010011010010001001110010110010000100111111110101110100000100111101110011010111010100100001110101110100000101100001110010010111101100011101110100110000000100101111110010110110111101001010111101101011110 e99a9ceba09a44e5909feba09ee6ba90eba0b0e4bd8ee98097e5b7a57de99a9ceba09a44e5909feba09ee6ba90eba0b0e4bd8ee98097e5b7a57b5e
UHC 障렚D吟렞源렰低逗工}障렚D吟렞源렰低逗工{^ 1110111010100001100011101010110101000100111010111110000110001110101011111110101010111001100011101011110111101110101110001101010011101000110011011110111101111101111011101010000110001110101011010100010011101011111000011000111010101111111010101011100110001110101111011110111010111000110101001110100011001101111011110111101101011110 eea18ead44ebe18eafeab98ebdeeb8d4e8cdef7deea18ead44ebe18eafeab98ebdeeb8d4e8cdef7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)