To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????U 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f55
SJIS-WIN シナ痔柴偲失シナ痔柴偲七シナ痔柴偲軸シナ痔柴偲蒔U 1011110011000101100011101010010010001110110001001000111011000011100011101011100010111100110001011000111010100100100011101100010010001110110000111000111010110101101111001100010110001110101001001000111011000100100011101100001110001110101100101011110011000101100011101010010010001110110001001000111011000011100011101010101001010101 bcc58ea48ec48ec38eb8bcc58ea48ec48ec38eb5bcc58ea48ec48ec38eb2bcc58ea48ec48ec38eaa55
EUC-JP シナ痔柴偲失シナ痔柴偲七シナ痔柴偲軸シナ痔柴偲蒔U 10001110101111001000111011000101101111001010011010111100110001101011110011000101101111001011101010001110101111001000111011000101101111001010011010111100110001101011110011000101101111001011011110001110101111001000111011000101101111001010011010111100110001101011110011000101101111001011010010001110101111001000111011000101101111001010011010111100110001101011110011000101101111001010110001010101 8ebc8ec5bca6bcc6bcc5bcba8ebc8ec5bca6bcc6bcc5bcb78ebc8ec5bca6bcc6bcc5bcb48ebc8ec5bca6bcc6bcc5bcac55
UTF-8 シナ痔柴偲失シナ痔柴偲七シナ痔柴偲軸シナ痔柴偲蒔U 11101111101111011011110011101111101111101000010111100111100101111001010011100110100111111011010011100101100000011011001011100101101001001011000111101111101111011011110011101111101111101000010111100111100101111001010011100110100111111011010011100101100000011011001011100100101110001000001111101111101111011011110011101111101111101000010111100111100101111001010011100110100111111011010011100101100000011011001011101000101110111011100011101111101111011011110011101111101111101000010111100111100101111001010011100110100111111011010011100101100000011011001011101000100100101001010001010101 efbdbcefbe85e79794e69fb4e581b2e5a4b1efbdbcefbe85e79794e69fb4e581b2e4b883efbdbcefbe85e79794e69fb4e581b2e8bbb8efbdbcefbe85e79794e69fb4e581b2e8929455
UHC ??痔柴?失??痔柴?七??痔柴?軸??痔柴?蒔U 00111111001111111111011011000000111000111100001100111111111000111111011100111111001111111111011011000000111000111100001100111111111101101101001000111111001111111111011011000000111000111100001100111111111101011110111000111111001111111111011011000000111000111100001100111111111000111100100001010101 3f3ff6c0e3c33fe3f73f3ff6c0e3c33ff6d23f3ff6c0e3c33ff5ee3f3ff6c0e3c33fe3c855

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)