To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 役?????轅??役?????音??役??? 100101101111000000111111001111110011111100111111001111111110011101110110001111110011111110010110111100000011111100111111001111110011111100111111100010011011100100111111001111111001011011110000001111110011111100111111 96f03f3f3f3f3fe7763f3f96f03f3f3f3f3f89b93f3f96f03f3f3f
EUC-JP 役?????轅??役?????音??役??? 110011001111001000111111001111110011111100111111001111111110110111010111001111110011111111001100111100100011111100111111001111110011111100111111101100101011101100111111001111111100110011110010001111110011111100111111 ccf23f3f3f3f3fedd73f3fccf23f3f3f3f3fb2bb3f3fccf23f3f3f
UTF-8 役대끇六쀥섧轅깅굹役대끇六쀥♤音붵뀅役대끇六 111001011011110110111001111010111000110010000000111010111000000110000111111011111010011110010001111011001000000010100101111011001000010010100111111010001011110110000101111010101011100110000101111010101011010110111001111001011011110110111001111010111000110010000000111010111000000110000111111011111010011110010001111011001000000010100101111000101001100110100100111010011001111110110011111010111011011010110101111010111000000010000101111001011011110110111001111010111000110010000000111010111000000110000111111011111010011110010001 e5bdb9eb8c80eb8187efa791ec80a5ec84a7e8bd85eab985eab5b9e5bdb9eb8c80eb8187efa791ec80a5e299a4e99fb3ebb6b5eb8085e5bdb9eb8c80eb8187efa791
UHC 役대끇六쀥섧轅깅굹役대끇六쀥♤音붵뀅役대끇六 1110011010110101101101001110101110000101101110111110101110111011100101111110010110111100101101011110101010111111101100011110101110000010100110001110011010110101101101001110101110000101101110111110101110111011100101111110010110100010101110111110101111100101100101001110001110000101100000011110011010110101101101001110101110000101101110111110101110111011 e6b5b4eb85bbebbb97e5bcb5eabfb1eb8298e6b5b4eb85bbebbb97e5a2bbebe594e38581e6b5b4eb85bbebbb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)