To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??膺??永 00111111001111110011111110001011100000110011111100111111111001000101111000111111001111111000100101101001 3f3f3f8b833f3fe45e3f3f8969
EUC-JP 艅??泣??膺??永 100011111101011011111101001111110011111110110101111000110011111100111111111001111011111100111111001111111011000111001010 8fd6fd3f3fb5e33f3fe7bf3f3fb1ca
UTF-8 艅덈쵓泣덅쫨膺삳떭永 111010001000100110000101111010111000110110001000111011001011010110010011111001101011001110100011111010111000110110000101111011001010101110101000111010001000011010111010111011001000001010110011111010111001011010101101111001101011000010111000 e88985eb8d88ecb593e6b3a3eb8d85ecaba8e886baec82b3eb96ade6b0b8
UHC 艅덈쵓泣덅쫨膺삳떭永 1110011010101001100010001110101110101100100101011110101111101000100010001110100010100110100000011110101111101100101110111110101110001011101111011110011110110101 e6a988ebac95ebe888e8a681ebecbbeb8bbde7b5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)