To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?M?p]nf?M?p]n^}Y?M?p]nf?M?p]n^}bE 001111110100110100111111011100000101110101101110011001100011111101001101001111110111000001011101011011100101111001111101010110010011111101001101001111110111000001011101011011100110011000111111010011010011111101110000010111010110111001011110011111010110001001000101 3f4d3f705d6e663f4d3f705d6e5e7d593f4d3f705d6e663f4d3f705d6e5e7d6245
SJIS-WIN 癌M?p]nf癌M?p]n^}Y癌M?p]nf癌M?p]n^}bE 10001010111000000100110100111111011100000101110101101110011001101000101011100000010011010011111101110000010111010110111001011110011111010101100110001010111000000100110100111111011100000101110101101110011001101000101011100000010011010011111101110000010111010110111001011110011111010110001001000101 8ae04d3f705d6e668ae04d3f705d6e5e7d598ae04d3f705d6e668ae04d3f705d6e5e7d6245
EUC-JP 癌M?p]nf癌M?p]n^}Y癌M?p]nf癌M?p]n^}bE 10110100111000100100110100111111011100000101110101101110011001101011010011100010010011010011111101110000010111010110111001011110011111010101100110110100111000100100110100111111011100000101110101101110011001101011010011100010010011010011111101110000010111010110111001011110011111010110001001000101 b4e24d3f705d6e66b4e24d3f705d6e5e7d59b4e24d3f705d6e66b4e24d3f705d6e5e7d6245
UTF-8 癌M卨p]nf癌M卨p]n^}Y癌M卨p]nf癌M卨p]n^}bE 11100111100110011000110001001101111001011000110110101000011100000101110101101110011001101110011110011001100011000100110111100101100011011010100001110000010111010110111001011110011111010101100111100111100110011000110001001101111001011000110110101000011100000101110101101110011001101110011110011001100011000100110111100101100011011010100001110000010111010110111001011110011111010110001001000101 e7998c4de58da8705d6e66e7998c4de58da8705d6e5e7d59e7998c4de58da8705d6e66e7998c4de58da8705d6e5e7d6245
UHC 癌M卨p]nf癌M卨p]n^}Y癌M卨p]nf癌M卨p]n^}bE 1110010011011111010011011110000011011001011100000101110101101110011001101110010011011111010011011110000011011001011100000101110101101110010111100111110101011001111001001101111101001101111000001101100101110000010111010110111001100110111001001101111101001101111000001101100101110000010111010110111001011110011111010110001001000101 e4df4de0d9705d6e66e4df4de0d9705d6e5e7d59e4df4de0d9705d6e66e4df4de0d9705d6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)