To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?る?毅??儒??鵝??援??飮??窈???? 001111111000001011101001001111111000101101000010001111110011111110001110111100100011111100111111111010100100000000111111001111111000100110000111001111110011111110011111010110100011111100111111111000100111011100111111001111110011111100111111 3f82e93f8b423f3f8ef23f3fea403f3f89873f3f9f5a3f3fe2773f3f3f3f
EUC-JP ?る?毅??儒??鵝??援??飮??窈???? 001111111010010011101011001111111011010110100011001111110011111110111100111101000011111100111111111100111010000100111111001111111011000111100111001111110011111111011101101110110011111100111111111000111101100000111111001111110011111100111111 3fa4eb3fb5a33f3fbcf43f3ff3a13f3fb1e73f3fddbb3f3fe3d83f3f3f3f
UTF-8 閭る벡毅볣윀儒띠젂鵝싰퉩援꿱읅飮뉕콨窈띾맪柳췇 111011111010011010000110111000111000001010001011111010111011001010100001111001101010111110000101111010111011001110100011111011001001110010000000111001011000010010010010111010111001110110100000111011001010000010000010111010011011010110011101111011001000101110110000111011011000100110101001111001101000111110110100111010101011111110110001111011001001110110000101111010011010001110101110111010111000100110010101111011001011110110101000111001111010101010001000111010111001110110111110111010111010011110101010111011111010011110001001111011001011011110000111 efa686e3828bebb2a1e6af85ebb3a3ec9c80e58492eb9da0eca082e9b59dec8bb0ed89a9e68fb4eabfb1ec9d85e9a3aeeb8995ecbda8e7aa88eb9dbeeba7aaefa789ecb787
UHC 閭る벡毅볣윀儒띠젂鵝싰퉩援꿱읅飮뉕콨窈띾맪柳췇 11100110101011011010101011101011101110101010010011101011111101101001001111101001100111111000101111101010111000111011011011101100101000001000011011100100101111011001101011101010101110011000000111101010101101011011001011101000100111111011101111101011111001101000011111101010101100011001110111101001101000011000110111101011100100001011001011101010111101111010111001000010 e6adaaebbaa4ebf693e99f8beae3b6eca086e4bd9aeab981eab5b2e89fbbebe687eab19de9a18deb90b2eaf7ae42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)