To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霓??肄??醫??筌??有??蟻?ぞ檍??怡 11101000101111010011111100111111111000111110010100111111001111111110011111001110001111110011111111100010101000110011111100111111100101110100110000111111001111111000101101100001001111111000001010111100100111101111100000111111001111111001110001111101 e8bd3f3fe3e53f3fe7ce3f3fe2a33f3f974c3f3f8b613f82bc9ef83f3f9c7d
EUC-JP 霓??肄??醫??筌??有??蟻?ぞ檍??怡 11110000101111110011111100111111111001101110011100111111001111111110111011010000001111110011111111100100101001010011111100111111110011011010110100111111001111111011010111000010001111111010010010111110110111001111101000111111001111111101011111011110 f0bf3f3fe6e73f3feed03f3fe4a53f3fcdad3f3fb5c23fa4bedcfa3f3fd7de
UTF-8 霓얠떜肄덃룚醫묅뵺筌먯쉶有댐쬂蟻귣ぞ檍됤꽒怡 111010011001110010010011111011001001011010100000111010111001011010011100111010001000001010000100111010111000110110000011111010111010001110011010111010011000011010101011111010111010110010000101111010111011010110111010111001111010110110001100111010111010100010101111111011001000100110110110111001101001110010001001111010111000110010010000111011001010110010000010111010001001111110111011111010101011011110100011111000111000000110011110111001101010101010001101111010111001000010100100111010101011110110010010111001101000000010100001 e99c93ec96a0eb969ce88284eb8d83eba39ae986abebac85ebb5bae7ad8ceba8afec89b6e69c89eb8c90ecac82e89fbbeab7a3e3819ee6aa8deb90a4eabd92e680a1
UHC 霓얠떜肄덃룚醫묅뵺筌먯쉶有댐쬂蟻귣ぞ檍됤꽒怡 1110011111100111101111101110110010001011101100101110110010111101100010001110011010001111100101101110110010100010100100011110001010010100101110001110111110100111100100001110110010011010100011001110101011110011101101001110111110100110100110011110101111111100100000101110101110101010101111101110010111100101100010011110001010000100101000011110110010101110 e7e7beec8bb2ecbd88e68f96eca291e294b8efa790ec9a8ceaf3b4efa699ebfc82ebaabee5e589e284a1ecae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)