To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????b???????????? 00111111001111110011111100111111001111110011111101100010001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f623f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN シナ痔柴赦鴫bシナ痔柴赦鴫シナ痔柴謝自 10111100110001011000111010100100100011101100010010001110110011011000111010110000011000101011110011000101100011101010010010001110110001001000111011001101100011101011000010111100110001011000111010100100100011101100010010001110110100111000111010101001 bcc58ea48ec48ecd8eb062bcc58ea48ec48ecd8eb0bcc58ea48ec48ed38ea9
EUC-JP シナ痔柴赦鴫bシナ痔柴赦鴫シナ痔柴謝自 10001110101111001000111011000101101111001010011010111100110001101011110011001111101111001011001001100010100011101011110010001110110001011011110010100110101111001100011010111100110011111011110010110010100011101011110010001110110001011011110010100110101111001100011010111100110101011011110010101011 8ebc8ec5bca6bcc6bccfbcb2628ebc8ec5bca6bcc6bccfbcb28ebc8ec5bca6bcc6bcd5bcab
UTF-8 シナ痔柴赦鴫bシナ痔柴赦鴫シナ痔柴謝自 11101111101111011011110011101111101111101000010111100111100101111001010011100110100111111011010011101000101101011010011011101001101101001010101101100010111011111011110110111100111011111011111010000101111001111001011110010100111001101001111110110100111010001011010110100110111010011011010010101011111011111011110110111100111011111011111010000101111001111001011110010100111001101001111110110100111010001010110010011101111010001000011110101010 efbdbcefbe85e79794e69fb4e8b5a6e9b4ab62efbdbcefbe85e79794e69fb4e8b5a6e9b4abefbdbcefbe85e79794e69fb4e8ac9de887aa
UHC ??痔柴赦?b??痔柴赦???痔柴謝自 0011111100111111111101101100000011100011110000111101111011110101001111110110001000111111001111111111011011000000111000111100001111011110111101010011111100111111001111111111011011000000111000111100001111011110111100111110110110111011 3f3ff6c0e3c3def53f623f3ff6c0e3c3def53f3f3ff6c0e3c3def3edbb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)