To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ?櫛?????? 001111111000101111111001001111110011111100111111001111110011111100111111 3f8bf93f3f3f3f3f3f
EUC-JP ?櫛?琁???? 0011111110110110111110110011111110001111110011001010001100111111001111110011111100111111 3fb6fb3f8fcca33f3f3f3f
UTF-8 렱櫛렗琁렯렓렯렠 111010111010000010110001111001101010101110011011111010111010000010010111111001111001000010000001111010111010000010101111111010111010000010010011111010111010000010101111111010111010000010100000 eba0b1e6ab9beba097e79081eba0afeba093eba0afeba0a0
UHC 렱櫛렗琁렯렓렯렠 10001110101111101111000111101110100011101010110011100000110001001000111010111100100011101010100010001110101111001000111010110001 8ebef1ee8eace0c48ebc8ea88ebc8eb1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)