To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??臾??邀 00111111001111110011111110001011100000110011111100111111111001000110101100111111001111111110011110110001 3f3f3f8b833f3fe46b3f3fe7b1
EUC-JP 艅??泣??臾??邀 100011111101011011111101001111110011111110110101111000110011111100111111111001111100110000111111001111111110111010110011 8fd6fd3f3fb5e33f3fe7cc3f3feeb3
UTF-8 艅덈냵泣쎿쓱臾덈룺邀 111010001000100110000101111010111000110110001000111010111000001110110101111001101011001110100011111011001000111010111111111011001001001110110001111010001000011110111110111010111000110110001000111010111010001110111010111010011000001010000000 e88985eb8d88eb83b5e6b3a3ec8ebfec93b1e887beeb8d88eba3bae98280
UHC 艅덈냵泣쎿쓱臾덈룺邀 1110011010101001100010001110101110000110100001011110101111101000100110111110011010111110101100111110101110101100100010001110101110001111101011011110100110101101 e6a988eb8685ebe89be6beb3ebac88eb8fade9ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)