To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?х????焉??松? 0011111110000100100001110011111100111111001111110011111111100000100000010011111100111111100011111011110000111111 3f84873f3f3f3fe0813f3f8fbc3f
EUC-JP ?х????焉??松? 0011111110100111111001110011111100111111001111110011111111011111111000010011111100111111101111101011111000111111 3fa7e73f3f3f3fdfe13f3fbebe3f
UTF-8 寧х쭅溜뗫젚焉뉙럺松쳿 1110111110100110101010101101000110000101111011001010110110000101111011111010011110001011111010111001011110101011111011001010000010011010111001111000010010001001111010111000100110011001111010111001111110111010111001101001110110111110111011001011001110111111 efa6aad185ecad85efa78beb97abeca09ae78489eb8999eb9fbae69dbeecb3bf
UHC 寧х쭅溜뗫젚焉뉙럺松쳿 11100111101011001010110011100111101001111000000111101010111111101000101111101011101000001001011011100101111010101000011111101101100011101001100111100001111001101010110001000010 e7acace7a781eafe8beba096e5ea87ed8e99e1e6ac42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)