To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄????????ル????純??亦??誼 1001011011101111001111110011111100111111001111110011111100111111001111110011111110000011100010110011111100111111001111110011111110001111100000110011111100111111100101101001001000111111001111111000101101100010 96ef3f3f3f3f3f3f3f3f838b3f3f3f3f8f833f3f96923f3f8b62
EUC-JP 厄????????ル????純??亦??誼 1100110011110001001111110011111100111111001111110011111100111111001111110011111110100101111010110011111100111111001111110011111110111101111000110011111100111111110010111111001000111111001111111011010111000011 ccf13f3f3f3f3f3f3f3fa5eb3f3f3f3fbde33f3fcbf23f3fb5c3
UTF-8 厄닌됱뵦閱겉껎뫛曆ルㅏ麟뗥윜純볩폍亦끸궗誼 111001011000111010000100111010111000101110001100111010111001000010110001111010111011010110100110111010011001011010110001111010101011001010001001111010101011101110001110111010111010101110011011111011111010011010001011111000111000001110101011111000111000010110001111111011111010011110110011111010111001011110100101111011001001110010011100111001111011010010010100111010111011001110101001111011011000111110001101111001001011101010100110111010111000000110111000111010101011011010010111111010001010101010111100 e58e84eb8b8ceb90b1ebb5a6e996b1eab289eabb8eebab9befa68be383abe3858fefa7b3eb97a5ec9c9ce7b494ebb3a9ed8f8de4baa6eb81b8eab697e8aabc
UHC 厄닌됱뵦閱겉껎뫛曆ルㅏ麟뗥윜純볩폍亦끸궗誼 111001001111100010110100110100011000100111101100100101001010010111100110111100111011000011010001100000111110110110010001101110111110011010110111101010111110101110100100101111111110110011101000100010111110010110011111100111111110001011101101100100111110111110111100100110001110011010110010100001011110001010000010101011001110101111111110 e4f8b4d189ec94a5e6f3b0d183ed91bbe6b7abeba4bfece88be59f9fe2ed93efbc98e6b285e282acebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)