To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????m?????????m^ 001111110011111100111111001111110011111100111111001111110011111100111111011011010011111100111111001111110011111100111111001111110011111100111111001111110110110101011110 3f3f3f3f3f3f3f3f3f6d3f3f3f3f3f3f3f3f3f6d5e
SJIS-WIN 遇????????m遇????????m^ 1000101111110110001111110011111100111111001111110011111100111111001111110011111101101101100010111111011000111111001111110011111100111111001111110011111100111111001111110110110101011110 8bf63f3f3f3f3f3f3f3f6d8bf63f3f3f3f3f3f3f3f6d5e
EUC-JP 遇?楗??????m遇?楗??????m^ 101101101111100000111111100011111100010011000110001111110011111100111111001111110011111100111111011011011011011011111000001111111000111111000100110001100011111100111111001111110011111100111111001111110110110101011110 b6f83f8fc4c63f3f3f3f3f3f6db6f83f8fc4c63f3f3f3f3f3f6d5e
UTF-8 遇띨楗泥렩裏흣렡렯m遇띨楗泥렩裏흣렡렯m^ 111010011000000110000111111010111001110110101000111001101010010110010111111011111010011110100011111010111010000010101001111011111010011110100111111011011001110110100011111010111010000010100001111010111010000010101111011011011110100110000001100001111110101110011101101010001110011010100101100101111110111110100111101000111110101110100000101010011110111110100111101001111110110110011101101000111110101110100000101000011110101110100000101011110110110101011110 e98187eb9da8e6a597efa7a3eba0a9efa7a7ed9da3eba0a1eba0af6de98187eb9da8e6a597efa7a3eba0a9efa7a7ed9da3eba0a1eba0af6d5e
UHC 遇띨楗泥렩裏흣렡렯m遇띨楗泥렩裏흣렡렯m^ 111010011110011110110110111011101100101111110001111011001011001010001110101101111110110011000000110010001110111010001110101100101000111010111100011011011110100111100111101101101110111011001011111100011110110010110010100011101011011111101100110000001100100011101110100011101011001010001110101111000110110101011110 e9e7b6eecbf1ecb28eb7ecc0c8ee8eb28ebc6de9e7b6eecbf1ecb28eb7ecc0c8ee8eb28ebc6d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)