To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 梧??汚??梧??釗??節??節ч?節?? 1000110011100110001111110011111110001001100110000011111100111111100011001110011000111111001111111111101110111011001111110011111110010000110111110011111100111111100100001101111110000100100010010011111110010000110111110011111100111111 8ce63f3f89983f3f8ce63f3ffbbb3f3f90df3f3f90df84893f90df3f3f
EUC-JP 梧??汚??梧??釗??節??節ч?節?? 101110001110100000111111001111111011000111111000001111110011111110111000111010000011111100111111100011111110001110100110001111110011111111000000111000010011111100111111110000001110000110100111111010010011111111000000111000010011111100111111 b8e83f3fb1f83f3fb8e83f3f8fe3a63f3fc0e13f3fc0e1a7e93fc0e13f3f
UTF-8 梧귨쉠汚억슬梧잌쨰釗녘쐢節띈쾫節ч뼹節김쐥 1110011010100010101001111110101010110111101010001110110010001001101000001110011010110001100110101110110010010110101101011110110010001010101011001110011010100010101001111110110010011110100011001110110010101000101100001110100110000111100101111110101110000101100110001110110010010000101000101110011110101111100000001110101110011101100010001110110010111110101010111110011110101111100000001101000110000111111010111011110010111001111001111010111110000000111010101011100110000000111011001001000010100101 e6a2a7eab7a8ec89a0e6b19aec96b5ec8aace6a2a7ec9e8ceca8b0e98797eb8598ec90a2e7af80eb9d88ecbeabe7af80d187ebbcb9e7af80eab980ec90a5
UHC 梧귨쉠汚억슬梧잌쨰釗녘쐢節띈쾫節ч뼹節김쐥 111001111111110010000010111011111011110110101010111001111111110110111110111011111011110110111101111001111111110010011111111001011010010010001010111000011111001010110011111010001001110010001000111011111011110110110110111010001011001010000010111011111011110110101100111010011001011010111100111011111011110110110001111010001001110010001010 e7fc82efbdaae7fdbeefbdbde7fc9fe5a48ae1f2b3e89c88efbdb6e8b282efbdace996bcefbdb1e89c8a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)