To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????b?????????bB 001111110011111100111111001111110011111100111111001111110011111100111111011000100011111100111111001111110011111100111111001111110011111100111111001111110110001001000010 3f3f3f3f3f3f3f3f3f623f3f3f3f3f3f3f3f3f6242
SJIS-WIN 永?????循??b永?????循??bB 10001001011010010011111100111111001111110011111100111111100011110111101000111111001111110110001010001001011010010011111100111111001111110011111100111111100011110111101000111111001111110110001001000010 89693f3f3f3f3f8f7a3f3f6289693f3f3f3f3f8f7a3f3f6242
EUC-JP 永?????循??b永?????循??bB 10110001110010100011111100111111001111110011111100111111101111011101101100111111001111110110001010110001110010100011111100111111001111110011111100111111101111011101101100111111001111110110001001000010 b1ca3f3f3f3f3fbddb3f3f62b1ca3f3f3f3f3fbddb3f3f6242
UTF-8 永띔퇌隣멨넼循륁뎾b永띔퇌隣멨넼循륁뎾bB 111001101011000010111000111010111001110110010100111011011000011110001100111011111010011110110001111010111010100110101000111010111000010010111100111001011011111010101010111010111010010110000001111010111000111010111110011000101110011010110000101110001110101110011101100101001110110110000111100011001110111110100111101100011110101110101001101010001110101110000100101111001110010110111110101010101110101110100101100000011110101110001110101111100110001001000010 e6b0b8eb9d94ed878cefa7b1eba9a8eb84bce5beaaeba581eb8ebe62e6b0b8eb9d94ed878cefa7b1eba9a8eb84bce5beaaeba581eb8ebe6242
UHC 永띔퇌隣멨넼循륁뎾b永띔퇌隣멨넼循륁뎾bB 111001111011010110110110111010101011011110011101111011001110010010111000111001011000011010110110111000101110000010001111111011001000100110010001011000101110011110110101101101101110101010110111100111011110110011100100101110001110010110000110101101101110001011100000100011111110110010001001100100010110001001000010 e7b5b6eab79dece4b8e586b6e2e08fec899162e7b5b6eab79dece4b8e586b6e2e08fec89916242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)