To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲μ?肄?癲μ?肄 111000011001111110000011110010100011111111100011111001010011111111100001100111111000001111001010001111111110001111100101 e19f83ca3fe3e53fe19f83ca3fe3e5
EUC-JP 癲μ?肄?癲μ?肄 111000101010000110100110110011000011111111100110111001110011111111100010101000011010011011001100001111111110011011100111 e2a1a6cc3fe6e73fe2a1a6cc3fe6e7
UTF-8 癲μ렲肄뻞癲μ렲肄 11100111100110011011001011001110101111001110101110100000101100101110100010000010100001001110101110111011100111101110011110011001101100101100111010111100111010111010000010110010111010001000001010000100 e799b2cebceba0b2e88284ebbb9ee799b2cebceba0b2e88284
UHC 癲μ렲肄뻞癲μ렲肄 111011111010011010100101111011001000111010111111111011001011110110010110011010001110111110100110101001011110110010001110101111111110110010111101 efa6a5ec8ebfecbd9668efa6a5ec8ebfecbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)