To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 畑??溢o????域????┐誘??暗 1001010010101000001111110011111110001000111011001000001010001111001111110011111100111111001111111000100011100110001111110011111100111111001111111000010010100010100101110101010100111111001111111000100011000011 94a83f3f88ec828f3f3f3f3f88e63f3f3f3f84a297553f3f88c3
EUC-JP 畑??溢o????域??馹?┐誘??暗 11001000101010100011111100111111101100001110111010100011111011110011111100111111001111110011111110110000111010000011111100111111100011111110100110100001001111111010100010100100110011011011011000111111001111111011000011000101 c8aa3f3fb0eea3ef3f3f3f3fb0e83f3f8fe9a13fa8a4cdb63f3fb0c5
UTF-8 畑듬뿰溢o쭏琉밸뙀域뱀룆馹깍┐誘↔덱暗 111001111001010110010001111010111001001110101100111010111011111110110000111001101011101010100010111011111011110110001111111011001010110110001111111011111010011110001100111010111011000010111000111010111001100110000000111001011001111110011111111010111011000110000000111010111010001110000110111010011010011010111001111010101011100110001101111000101001010010010000111010001010101010011000111000101000011010010100111010111000110110110001111001101001101010010111 e79591eb93acebbfb0e6baa2efbd8fecad8fefa78cebb0b8eb9980e59f9febb180eba386e9a6b9eab98de29490e8aa98e28694eb8db1e69a97
UHC 畑듬뿰溢o쭏琉밸뙀域뱀룆馹깍┐誘↔덱暗 1110111110100101101101011110101110010111101100001110110011101110101000111110111110100111100010001110101110100100101110011110101110001100100001101110011010110100101110011110110010001111100001011110110011110001101100011110111110100110101001001110101110101111101000011110101010110101101001101110010011011110 efa5b5eb97b0eceea3efa788eba4b9eb8c86e6b4b9ec8f85ecf1b1efa6a4ebafa1eab5a6e4de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)