To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????W}???????????W{^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010111011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 淨朧?楮?┳樗??樗?W}淨朧?楮?┳樗??樗?W{^ 100111111100010010011110010011110011111110011110101110000011111110000100101100011001001010010100001111110011111110010010100101000011111101010111011111011001111111000100100111100100111100111111100111101011100000111111100001001011000110010010100101000011111100111111100100101001010000111111010101110111101101011110 9fc49e4f3f9eb83f84b192943f3f92943f577d9fc49e4f3f9eb83f84b192943f3f92943f577b5e
EUC-JP 淨朧?楮?┳樗??樗?W}淨朧?楮?┳樗??樗?W{^ 110111101100011011011011101100000011111111011100101110100011111110101000101100111100001111110100001111110011111111000011111101000011111101010111011111011101111011000110110110111011000000111111110111001011101000111111101010001011001111000011111101000011111100111111110000111111010000111111010101110111101101011110 dec6dbb03fdcba3fa8b3c3f43f3fc3f43f577ddec6dbb03fdcba3fa8b3c3f43f3fc3f43f577b5e
UTF-8 淨朧떫楮잴┳樗흘⊙樗렡W}淨朧떫楮잴┳樗흘⊙樗렡W{^ 1110011010110111101010001110011010011100101001111110101110010110101010111110011010100101101011101110110010011110101101001110001010010100101100111110011010101000100101111110110110011101100110001110001010001010100110011110011010101000100101111110101110100000101000010101011101111101111001101011011110101000111001101001110010100111111010111001011010101011111001101010010110101110111011001001111010110100111000101001010010110011111001101010100010010111111011011001110110011000111000101000101010011001111001101010100010010111111010111010000010100001010101110111101101011110 e6b7a8e69ca7eb96abe6a5aeec9eb4e294b3e6a897ed9d98e28a99e6a897eba0a1577de6b7a8e69ca7eb96abe6a5aeec9eb4e294b3e6a897ed9d98e28a99e6a897eba0a1577b5e
UHC 淨朧떫楮잴┳樗흘⊙樗렡W}淨朧떫楮잴┳樗흘⊙樗렡W{^ 11101111111001001101011011101000101101101011010111101110101111111100000011101010101001101011001111101110110000001100100011101010101000101100000111101110110000001000111010110010010101110111110111101111111001001101011011101000101101101011010111101110101111111100000011101010101001101011001111101110110000001100100011101010101000101100000111101110110000001000111010110010010101110111101101011110 efe4d6e8b6b5eebfc0eaa6b3eec0c8eaa2c1eec08eb2577defe4d6e8b6b5eebfc0eaa6b3eec0c8eaa2c1eec08eb2577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)