To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 艶k?狎??闇??押 100010011001000010000010100010110011111111100000101111100011111100111111100010001100010100111111001111111000100110011111 8990828b3fe0be3f3f88c53f3f899f
EUC-JP 艶k?狎??闇??押 101100011111000010100011111010110011111111100000110000000011111100111111101100001100011100111111001111111011001010100001 b1f0a3eb3fe0c03f3fb0c73f3fb2a1
UTF-8 艶k졁狎띾㈀闇낂슌押 111010001000100110110110111011111011110110001011111011001010000110000001111001111000101110001110111010111001110110111110111000111000100010000000111010011001011110000111111010111000001010000010111011001000101010001100111001101000101010111100 e889b6efbd8beca181e78b8eeb9dbee38880e99787eb8282ec8a8ce68abc
UHC 艶k졁狎띾㈀闇낂슌押 1110011011111101101000111110101110100000101100101110010011100100100011011110101110101001101100011110010011100001100001011110100110011010100111001110010011100011 e6fda3eba0b2e4e48deba9b1e4e185e99a9ce4e3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)