To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 而孟??毅冪?磐?峰?孟??毅冪?磐?六 10001110101001111001011011010000001111110011111110001011010000101001100101110000001111111001010011010110001111111001010111110100001111111001011011010000001111110011111110001011010000101001100101110000001111111001010011010110001111111001100001011010 8ea796d03f3f8b4299703f94d63f95f43f96d03f3f8b4299703f94d63f985a
EUC-JP 而孟??毅冪?磐?峰?孟??毅冪?磐?六 10111100101010011100110011010010001111110011111110110101101000111101000111010001001111111100100011011000001111111100101011110110001111111100110011010010001111110011111110110101101000111101000111010001001111111100100011011000001111111100111110111011 bca9ccd23f3fb5a3d1d13fc8d83fcaf63fccd23f3fb5a3d1d13fc8d83fcfbb
UTF-8 而孟렫렲毅冪렱磐렱峰렫孟렫렲毅冪렱磐렱六 111010001000000010001100111001011010110110011111111010111010000010101011111010111010000010110010111001101010111110000101111001011000011010101010111010111010000010110001111001111010001110010000111010111010000010110001111001011011001110110000111010111010000010101011111001011010110110011111111010111010000010101011111010111010000010110010111001101010111110000101111001011000011010101010111010111010000010110001111001111010001110010000111010111010000010110001111001011000010110101101 e8808ce5ad9feba0abeba0b2e6af85e586aaeba0b1e7a390eba0b1e5b3b0eba0abe5ad9feba0abeba0b2e6af85e586aaeba0b1e7a390eba0b1e585ad
UHC 而孟렫렲毅冪렱磐렱峰렫孟렫렲毅冪렱磐렱六 11101100101110111101100011101011100011101011100110001110101111111110101111110110110110001111000110001110101111101101101011110001100011101011111011011100111010001000111010111001110110001110101110001110101110011000111010111111111010111111011011011000111100011000111010111110110110101111000110001110101111101101011110111111 ecbbd8eb8eb98ebfebf6d8f18ebedaf18ebedce88eb9d8eb8eb98ebfebf6d8f18ebedaf18ebed7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)