To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????≫?癲??伊 00111111001111110011111100111111100000011110001000111111111000011001111100111111001111111000100011001001 3f3f3f3f81e23fe19f3f3f88c9
EUC-JP ????≫?癲??伊 00111111001111110011111100111111101000101110010000111111111000101010000100111111001111111011000011001011 3f3f3f3fa2e43fe2a13f3fb0cb
UTF-8 捻뀁뼐璘≫씙癲싰퀡伊 111011111010011010100100111010111000000010000001111010111011110010010000111011111010011110101111111000101000100110101011111011001001010010011001111001111001100110110010111011001000101110110000111011011000000010100001111001001011110010001010 efa6a4eb8081ebbc90efa7afe289abec9499e799b2ec8bb0ed80a1e4bc8a
UHC 捻뀁뼐璘≫씙癲싰퀡伊 1110011011110111101100101110110010010110100110001110110011011110101000011110110110011101101011101110111110100110100110101110101010110011100101011110110010100101 e6f7b2ec9698ecdea1ed9daeefa69aeab395eca5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)