To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?氓????夭!雰?氓????夭!雰B 001111111001111110000010001111110011111100111111001111111001101011101110100000010100100110010101101101010011111110011111100000100011111100111111001111110011111110011010111011101000000101001001100101011011010101000010 3f9f823f3f3f3f9aee814995b53f9f823f3f3f3f9aee814995b542
EUC-JP ?氓????夭!雰?氓????夭!雰B 001111111101110111100010001111110011111100111111001111111101010011110000101000011010101011001010101101110011111111011101111000100011111100111111001111110011111111010100111100001010000110101010110010101011011101000010 3fdde23f3f3f3fd4f0a1aacab73fdde23f3f3f3fd4f0a1aacab742
UTF-8 뤗氓댈쫷빅넜夭!雰뤗氓댈쫷빅넜夭!雰B 11101011101001001001011111100110101100001001001111101011100011001000100011101100101010111011011111101011101110011000010111101011100001001001110011100101101001001010110111101111101111001000000111101001100110111011000011101011101001001001011111100110101100001001001111101011100011001000100011101100101010111011011111101011101110011000010111101011100001001001110011100101101001001010110111101111101111001000000111101001100110111011000001000010 eba497e6b093eb8c88ecabb7ebb985eb849ce5a4adefbc81e99bb0eba497e6b093eb8c88ecabb7ebb985eb849ce5a4adefbc81e99bb042
UHC 뤗氓댈쫷빅넜夭!雰뤗氓댈쫷빅넜夭!雰B 10001111110001111101100011101100101101001110111010100110100011101011101011110010101100111101010011101000111011001010001110100001110111011101010010001111110001111101100011101100101101001110111010100110100011101011101011110010101100111101010011101000111011001010001110100001110111011101010001000010 8fc7d8ecb4eea68ebaf2b3d4e8eca3a1ddd48fc7d8ecb4eea68ebaf2b3d4e8eca3a1ddd442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)