To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 訂葛??彧?蠢?砥?瓦?訂葛??彧?蠢?砥?瓦?^ 10010010111110011000101010001011001111110011111111111010101110010011111111100101101111110011111110010011011101010011111110001010101000100011111110010010111110011000101010001011001111110011111111111010101110010011111111100101101111110011111110010011011101010011111110001010101000100011111101011110 92f98a8b3f3ffab93fe5bf3f93753f8aa23f92f98a8b3f3ffab93fe5bf3f93753f8aa23f5e
EUC-JP 訂葛??彧?蠢?砥?瓦?訂葛??彧?蠢?砥?瓦?^ 110001001111101110110011111010110011111100111111100011111011110011111110001111111110101011000001001111111100010111010110001111111011010010100100001111111100010011111011101100111110101100111111001111111000111110111100111111100011111111101010110000010011111111000101110101100011111110110100101001000011111101011110 c4fbb3eb3f3f8fbcfe3feac13fc5d63fb4a43fc4fbb3eb3f3f8fbcfe3feac13fc5d63fb4a43f5e
UTF-8 訂葛렢렓彧렢蠢렎砥렫瓦쌨訂葛렢렓彧렢蠢렎砥렫瓦쌤^ 11101000101010001000001011101000100100011001101111101011101000001010001011101011101000001001001111100101101111011010011111101011101000001010001011101000101000001010001011101011101000001000111011100111101000001010010111101011101000001010101111100111100100111010011011101100100011001010100011101000101010001000001011101000100100011001101111101011101000001010001011101011101000001001001111100101101111011010011111101011101000001010001011101000101000001010001011101011101000001000111011100111101000001010010111101011101000001010101111100111100100111010011011101100100011001010010001011110 e8a882e8919beba0a2eba093e5bda7eba0a2e8a0a2eba08ee7a0a5eba0abe793a6ec8ca8e8a882e8919beba0a2eba093e5bda7eba0a2e8a0a2eba08ee7a0a5eba0abe793a6ec8ca45e
UHC 訂葛렢렓彧렢蠢렎砥렫瓦쌨訂葛렢렓彧렢蠢렎砥렫瓦쌤^ 11101111111101001100101011100111100011101011001110001110101010001110100111101110100011101011001111110001111000111000111010100100111100101011001010001110101110011110100010111111101111011101111011101111111101001100101011100111100011101011001110001110101010001110100111101110100011101011001111110001111000111000111010100100111100101011001010001110101110011110100010111111101111011101110001011110 eff4cae78eb38ea8e9ee8eb3f1e38ea4f2b28eb9e8bfbddeeff4cae78eb38ea8e9ee8eb3f1e38ea4f2b28eb9e8bfbddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)