To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????[????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f
SJIS-WIN 短臓蔵辿孫多短贈族竪続捉[短贈測脱賊 1001001001011010100100011001111110010001101000001001001001001000100100011011011110010001101111011001001001011010100100011010000110010001101100001001001001000111100100011011000110010001101010000101101110010010010110101001000110100001100100011010101010010010010001011001000110101111 925a919f91a0924891b791bd925a91a191b0924791b191a85b925a91a191aa924591af
EUC-JP 短臓蔵辿孫多短贈族竪続捉[短贈測脱賊 1100001110111011110000101010000111000010101000101100001110101001110000101011100111000010101111111100001110111011110000101010001111000010101100101100001110101000110000101011001111000010101010100101101111000011101110111100001010100011110000101010110011000011101001101100001010110001 c3bbc2a1c2a2c3a9c2b9c2bfc3bbc2a3c2b2c3a8c2b3c2aa5bc3bbc2a3c2acc3a6c2b1
UTF-8 短臓蔵辿孫多短贈族竪続捉[短贈測脱賊 11100111100111111010110111101000100001111001001111101000100101001011010111101000101111101011111111100101101011011010101111100101101001001001101011100111100111111010110111101000101101001000100011100110100101111000111111100111101010111010101011100111101101101001101011100110100011011000100101011011111001111001111110101101111010001011010010001000111001101011100010101100111010001000010010110001111010001011001110001010 e79fade88793e894b5e8bebfe5adabe5a49ae79fade8b488e6978fe7abaae7b69ae68d895be79fade8b488e6b8ace884b1e8b38a
UHC 短???孫多短贈族竪?捉[短贈測?賊 110100111010110100111111001111110011111111100001110111011101001011111101110100111010110111110001111111001111000011101001111000101011010100111111111100111011010101011011110100111010110111110001111111001111011010110100001111111110111011100100 d3ad3f3f3fe1ddd2fdd3adf1fcf0e9e2b53ff3b55bd3adf1fcf6b43feee4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)