To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 壤??鎰??魏??永??裕?壤??鎰??韋?? 10011010110111110011111100111111111010000100110000111111001111111110100110110000001111110011111110001001011010010011111100111111100101110101010000111111100110101101111100111111001111111110100001001100001111110011111111101000111010000011111100111111 9adf3f3fe84c3f3fe9b03f3f89693f3f97543f9adf3f3fe84c3f3fe8e83f3f
EUC-JP 壤??鎰??魏??永??裕?壤??鎰??韋?? 11010100111000010011111100111111111011111010110100111111001111111111001010110010001111110011111110110001110010100011111100111111110011011011010100111111110101001110000100111111001111111110111110101101001111110011111111110000111010100011111100111111 d4e13f3fefad3f3ff2b23f3fb1ca3f3fcdb53fd4e13f3fefad3f3ff0ea3f3f
UTF-8 壤깆쥜鎰쏁독魏됲뭲永띕굙裕꿆壤깆쥜鎰쏁독韋블닂 111001011010001110100100111010101011100110000110111011001010010110011100111010011000111010110000111011001000111110000001111010111000111110000101111010011010110110001111111010111001000010110010111010111010110110110010111001101011000010111000111010111001110110010101111010101011010110011001111010001010001110010101111010101011111110000110111001011010001110100100111010101011100110000110111011001010010110011100111010011000111010110000111011001000111110000001111010111000111110000101111010011001111110001011111010111011100010010100111010111000101110000010 e5a3a4eab986eca59ce98eb0ec8f81eb8f85e9ad8feb90b2ebadb2e6b0b8eb9d95eab599e8a395eabf86e5a3a4eab986eca59ce98eb0ec8f81eb8f85e99f8bebb894eb8b82
UHC 壤깆쥜鎰쏁독魏됲뭲永띕굙裕꿆壤깆쥜鎰쏁독韋블닂 11100101101111011011000111101100101000101001000111101100111100001001101111100111101101011011011011101010111000001000100111101101100100101000000111100111101101011011011011101011100000101000000111101011101011101000010101000111111001011011110110110001111011001010001010010001111011001111000010011011111001111011010110110110111010101101111110111010111011011000100010001011 e5bdb1eca291ecf09be7b5b6eae089ed9281e7b5b6eb8281ebae8547e5bdb1eca291ecf09be7b5b6eadfbaed888b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)