To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨?垣?鬱豆??衣私??淨?垣?鬱豆??衣私??^ 10011111110001000011111110001010010111110011111110011111010101001001001110100100001111110011111110001000110111111000111010000100001111110011111110011111110001000011111110001010010111110011111110011111010101001001001110100100001111110011111110001000110111111000111010000100001111110011111101011110 9fc43f8a5f3f9f5493a43f3f88df8e843f3f9fc43f8a5f3f9f5493a43f3f88df8e843f3f5e
EUC-JP 淨?垣?鬱豆??衣私??淨?垣?鬱豆??衣私??^ 11011110110001100011111110110011110000000011111111011101101101011100011010100110001111110011111110110000111000011011101111100100001111110011111111011110110001100011111110110011110000000011111111011101101101011100011010100110001111110011111110110000111000011011101111100100001111110011111101011110 dec63fb3c03fddb5c6a63f3fb0e1bbe43f3fdec63fb3c03fddb5c6a63f3fb0e1bbe43f3f5e
UTF-8 淨렠垣렖鬱豆렩렰衣私렟넸淨렠垣렖鬱豆렩렰衣私렟넵^ 11100110101101111010100011101011101000001010000011100101100111101010001111101011101000001001011011101001101011001011000111101000101100011000011011101011101000001010100111101011101000001011000011101000101000011010001111100111101001111000000111101011101000001001111111101011100001001011100011100110101101111010100011101011101000001010000011100101100111101010001111101011101000001001011011101001101011001011000111101000101100011000011011101011101000001010100111101011101000001011000011101000101000011010001111100111101001111000000111101011101000001001111111101011100001001011010101011110 e6b7a8eba0a0e59ea3eba096e9acb1e8b186eba0a9eba0b0e8a1a3e7a781eba09feb84b8e6b7a8eba0a0e59ea3eba096e9acb1e8b186eba0a9eba0b0e8a1a3e7a781eba09feb84b55e
UHC 淨렠垣렖鬱豆렩렰衣私렟넸淨렠垣렖鬱豆렩렰衣私렟넵^ 11101111111001001000111010110001111010101010111110001110101010111110101010100110110101001110011110001110101101111000111010111101111010111111110111011110111001111000111010110000101100111101111011101111111001001000111010110001111010101010111110001110101010111110101010100110110101001110011110001110101101111000111010111101111010111111110111011110111001111000111010110000101100111101110001011110 efe48eb1eaaf8eabeaa6d4e78eb78ebdebfddee78eb0b3deefe48eb1eaaf8eabeaa6d4e78eb78ebdebfddee78eb0b3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)