To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 訝??????ょ? 1110011001100010001111110011111100111111001111110011111100111111100000101110010100111111 e6623f3f3f3f3f3f82e53f
EUC-JP 訝??????ょ? 1110101111000011001111110011111100111111001111110011111100111111101001001110011100111111 ebc33f3f3f3f3f3fa4e73f
UTF-8 訝덌풛若잝떋銳ょ겮 111010001010100010011101111010111000110110001100111011011001001010011011111011111010010110110100111011001001111010011101111010111001011010001011111010011000101010110011111000111000001010000111111010101011001010101110 e8a89deb8d8ced929befa5b4ec9e9deb968be98ab3e38287eab2ae
UHC 訝덌풛若잝떋銳ょ겮 111001001011100010001000111011111011111010011110111001011010111010011111111011101000101110100001111001111110010110101010111001111000000110111100 e4b888efbe9ee5ae9fee8ba1e7e5aae781bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)