To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??U???????zh??U???????z 0011111100111111010101010011111100111111001111110011111100111111001111110011111101111010011010000011111100111111010101010011111100111111001111110011111100111111001111110011111101111010 3f3f553f3f3f3f3f3f3f7a683f3f553f3f3f3f3f3f3f7a
SJIS-WIN 偲璽U篠クナ鹿篠ス。zh偲璽U篠クナ鹿篠ス。z 100011101100001110001110101000110101010110001110110000101011100011000101100011101010110110001110110000101011110110100001011110100110100010001110110000111000111010100011010101011000111011000010101110001100010110001110101011011000111011000010101111011010000101111010 8ec38ea3558ec2b8c58ead8ec2bda17a688ec38ea3558ec2b8c58ead8ec2bda17a
EUC-JP 偲璽U篠クナ鹿篠ス。zh偲璽U篠クナ鹿篠ス。z 1011110011000101101111001010010101010101101111001100010010001110101110001000111011000101101111001010111110111100110001001000111010111101100011101010000101111010011010001011110011000101101111001010010101010101101111001100010010001110101110001000111011000101101111001010111110111100110001001000111010111101100011101010000101111010 bcc5bca555bcc48eb88ec5bcafbcc48ebd8ea17a68bcc5bca555bcc48eb88ec5bcafbcc48ebd8ea17a
UTF-8 偲璽U篠クナ鹿篠ス。zh偲璽U篠クナ鹿篠ス。z 1110010110000001101100101110011110010010101111010101010111100111101011111010000011101111101111011011100011101111101111101000010111101001101110011011111111100111101011111010000011101111101111011011110111101111101111011010000101111010011010001110010110000001101100101110011110010010101111010101010111100111101011111010000011101111101111011011100011101111101111101000010111101001101110011011111111100111101011111010000011101111101111011011110111101111101111011010000101111010 e581b2e792bd55e7afa0efbdb8efbe85e9b9bfe7afa0efbdbdefbda17a68e581b2e792bd55e7afa0efbdb8efbe85e9b9bfe7afa0efbdbdefbda17a
UHC ?璽U篠??鹿篠??zh?璽U篠??鹿篠??z 00111111110111111101111001010101111000011100011000111111001111111101011011100011111000011100011000111111001111110111101001101000001111111101111111011110010101011110000111000110001111110011111111010110111000111110000111000110001111110011111101111010 3fdfde55e1c63f3fd6e3e1c63f3f7a683fdfde55e1c63f3fd6e3e1c63f3f7a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)