To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚????ぜ???筌??誼??蹂λ????? 10011010011010100011111100111111001111110011111110000010101110100011111100111111001111111110001010100011001111110011111110001011011000100011111100111111111001101111100010000011110010010011111100111111001111110011111100111111 9a6a3f3f3f3f82ba3f3f3fe2a33f3f8b623f3fe6f883c93f3f3f3f3f
EUC-JP 嗚????ぜ洧??筌??誼??蹂λ?孼??彛 11010011110010110011111100111111001111110011111110100100101111001000111111000111101101000011111100111111111001001010010100111111001111111011010111000011001111110011111111101100111110101010011011001011001111111000111110111010110000110011111100111111100011111011110011111010 d3cb3f3f3f3fa4bc8fc7b43f3fe4a53f3fb5c33f3fecfaa6cb3f8fbac33f3f8fbcfa
UTF-8 嗚삳챶栒뤺ぜ洧몃쎗筌뤾퍔誼잌꽱蹂λ렰孼닿쒀彛 1110010110010111100110101110110010000010101100111110110010110001101101101110011010100000100100101110101110100100101110101110001110000001100111001110011010110100101001111110101110101010100000111110110010001110100101111110011110101101100011001110101110100100101111101110110110001101100101001110100010101010101111001110110010011110100011001110101010111101101100011110100010111001100000101100111010111011111010111010000010110000111001011010110110111100111010111000101110111111111011001001001010000000111001011011110110011011 e5979aec82b3ecb1b6e6a092eba4bae3819ce6b4a7ebaa83ec8e97e7ad8ceba4beed8d94e8aabcec9e8ceabdb1e8b982cebbeba0b0e5adbceb8bbfec9280e5bd9b
UHC 嗚삳챶栒뤺ぜ洧몃쎗筌뤾퍔誼잌꽱蹂λ렰孼닿쒀彛 1110011111110000101110111110101110101010100000111110001011100011100011111110100010101010101111001110101011111011101110001110101110011011101111101110111110100111100011111110101010111011100010111110101111111110100111111110010110000100101111001110101110110011101001011110101110001110101111011110010111101101101101001110101010111110101011001110110010101101 e7f0bbebaa83e2e38fe8aabceafbb8eb9bbeefa78feabb8bebfe9fe584bcebb3a5eb8ebde5edb4eabeacecad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)