To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 闇μ???闇μ???^ 100010001100010110000011110010100011111100111111001111111000100011000101100000111100101000111111001111110011111101011110 88c583ca3f3f3f88c583ca3f3f3f5e
EUC-JP 闇μ???闇μ???^ 101100001100011110100110110011000011111100111111001111111011000011000111101001101100110000111111001111110011111101011110 b0c7a6cc3f3f3fb0c7a6cc3f3f3f5e
UTF-8 闇μ늺六뾆闇μ늺六뾆^ 1110100110010111100001111100111010111100111010111000101010111010111011111010011110010001111010111011111010000110111010011001011110000111110011101011110011101011100010101011101011101111101001111001000111101011101111101000011001011110 e99787cebceb8abaefa791ebbe86e99787cebceb8abaefa791ebbe865e
UHC 闇μ늺六뾆闇μ늺六뾆^ 111001001110000110100101111011001000100010000011111010111011101110010111010001001110010011100001101001011110110010001000100000111110101110111011100101110100010001011110 e4e1a5ec8883ebbb9744e4e1a5ec8883ebbb97445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)