To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??紆?寃???垣?鬱大??紆?寃???垣?鬱垈^ 0011111100111111111000101111110000111111100110111000001100111111001111110011111110001010010111110011111110011111010101001001000111100101001111110011111111100010111111000011111110011011100000110011111100111111001111111000101001011111001111111001111101010100100110101011000001011110 3f3fe2fc3f9b833f3f3f8a5f3f9f5491e53f3fe2fc3f9b833f3f3f8a5f3f9f549ab05e
EUC-JP ??紆?寃???垣?鬱大??紆?寃???垣?鬱垈^ 0011111100111111111001001111111000111111110101011110001100111111001111110011111110110011110000000011111111011101101101011100001011100111001111110011111111100100111111100011111111010101111000110011111100111111001111111011001111000000001111111101110110110101110101001011001001011110 3f3fe4fe3fd5e33f3f3fb3c03fddb5c2e73f3fe4fe3fd5e33f3f3fb3c03fddb5d4b25e
UTF-8 亐렕紆렣寃닿렱렲垣렖鬱大亐렕紆렣寃닿렱렲垣렖鬱垈^ 11100100101110101001000011101011101000001001010111100111101101001000011011101011101000001010001111100101101011111000001111101011100010111011111111101011101000001011000111101011101000001011001011100101100111101010001111101011101000001001011011101001101011001011000111100101101001001010011111100100101110101001000011101011101000001001010111100111101101001000011011101011101000001010001111100101101011111000001111101011100010111011111111101011101000001011000111101011101000001011001011100101100111101010001111101011101000001001011011101001101011001011000111100101100111101000100001011110 e4ba90eba095e7b486eba0a3e5af83eb8bbfeba0b1eba0b2e59ea3eba096e9acb1e5a4a7e4ba90eba095e7b486eba0a3e5af83eb8bbfeba0b1eba0b2e59ea3eba096e9acb1e59e885e
UHC 亐렕紆렣寃닿렱렲垣렖鬱大亐렕紆렣寃닿렱렲垣렖鬱垈^ 11101010101001111000111010101010111010011110000110001110101101001110101010110010101101001110101010001110101111101000111010111111111010101010111110001110101010111110101010100110110100111101111011101010101001111000111010101010111010011110000110001110101101001110101010110010101101001110101010001110101111101000111010111111111010101010111110001110101010111110101010100110110100111101110001011110 eaa78eaae9e18eb4eab2b4ea8ebe8ebfeaaf8eabeaa6d3deeaa78eaae9e18eb4eab2b4ea8ebe8ebfeaaf8eabeaa6d3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)