To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 伍ュ????占 10001100110111101000001110000101001111110011111100111111001111111001000011101000 8cde83853f3f3f3f90e8
EUC-JP 伍ュ????占 10111000111000001010010111100101001111110011111100111111001111111100000011101010 b8e0a5e53f3f3f3fc0ea
UTF-8 伍ュ읆隸잒킊占 111001001011110010001101111000111000001110100101111011001001110110000110111011111010011010111000111011001001111010010010111011011000001010001010111001011000110110100000 e4bc8de383a5ec9d86efa6b8ec9e92ed828ae58da0
UHC 伍ュ읆隸잒킊占 1110011111101010101010111110010110011111101111001110011111100110100111111110100010110100100101101110111110111111 e7eaabe59fbce7e69fe8b496efbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)