To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???膺??乙≫?永??膺??乙щ?永 001111110011111100111111111001000101111000111111001111111000100110110011100000011110001000111111100010010110100100111111001111111110010001011110001111110011111110001001101100111000010010001011001111111000100101101001 3f3f3fe45e3f3f89b381e23f89693f3fe45e3f3f89b3848b3f8969
EUC-JP 倻??膺??乙≫?永??膺??乙щ?永 1000111110110001111101100011111100111111111001111011111100111111001111111011001010110101101000101110010000111111101100011100101000111111001111111110011110111111001111110011111110110010101101011010011111101011001111111011000111001010 8fb1f63f3fe7bf3f3fb2b5a2e43fb1ca3f3fe7bf3f3fb2b5a7eb3fb1ca
UTF-8 倻귣떩膺곦벧乙≫닍永띕벙膺곦벧乙щ씩永 1110010110000000101110111110101010110111101000111110101110010110101010011110100010000110101110101110101010110011101001101110101110110010101001111110010010111001100110011110001010001001101010111110101110001011100011011110011010110000101110001110101110011101100101011110101110110010100110011110100010000110101110101110101010110011101001101110101110110010101001111110010010111001100110011101000110001001111011001001010010101001111001101011000010111000 e580bbeab7a3eb96a9e886baeab3a6ebb2a7e4b999e289abeb8b8de6b0b8eb9d95ebb299e886baeab3a6ebb2a7e4b999d189ec94a9e6b0b8
UHC 倻귣떩膺곦벧乙≫닍永띕벙膺곦벧乙щ씩永 1110010110100110100000101110101110001011101110111110101111101100100000011110010010111010101001101110101111100000101000011110110110001000100100111110011110110101101101101110101110111010101000011110101111101100100000011110010010111010101001101110101111100000101011001110101110111110101111111110011110110101 e5a682eb8bbbebec81e4baa6ebe0a1ed8893e7b5b6ebbaa1ebec81e4baa6ebe0acebbebfe7b5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)