To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??泣ワ??κ?壓??唯?? 111000101010001100111111001111111000101110000011100000111000111100111111001111111000001111001000001111111001101011011000001111110011111110010111010000100011111100111111 e2a33f3f8b83838f3f3f83c83f9ad83f3f97423f3f
EUC-JP 筌??泣ワ?洹κ?壓??唯?? 1110010010100101001111110011111110110101111000111010010111101111001111111000111111000111101110101010011011001010001111111101010011011010001111110011111111001101101000110011111100111111 e4a53f3fb5e3a5ef3f8fc7baa6ca3fd4da3f3fcda33f3f
UTF-8 筌뚮뿦泣ワ㎗洹κ묘壓꾩옃唯㏆쭔 1110011110101101100011001110101110011010101011101110101110111111101001101110011010110011101000111110001110000011101011111110001110001110100101111110011010110100101110011100111010111010111010111010110010011000111001011010001110010011111010101011111010101001111011001001100010000011111001011001010010101111111000111000111110000110111011001010110110010100 e7ad8ceb9aaeebbfa6e6b3a3e383afe38e97e6b4b9cebaebac98e5a393eabea9ec9883e594afe38f86ecad94
UHC 筌뚮뿦泣ワ㎗洹κ묘壓꾩옃唯㏆쭔 111011111010011110001100111010111001011110100110111010111110100010101011111011111010011110100011111010101011011110100101111010101011100110100110111001001110001010000100111011001001111010001111111010101110011010100111111011111010011110001100 efa78ceb97a6ebe8abefa7a3eab7a5eab9a6e4e284ec9e8feae6a7efa78c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)