To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???膺??違??永??裕??議?????膺?? 001111110011111100111111111001000101111000111111001111111000100011100001001111110011111110001001011010010011111100111111100101110101010000111111001111111000101101100011001111110011111100111111001111110011111111100100010111100011111100111111 3f3f3fe45e3f3f88e13f3f89693f3f97543f3f8b633f3f3f3f3fe45e3f3f
EUC-JP ???膺??違??永??裕??議?????膺?? 001111110011111100111111111001111011111100111111001111111011000011100011001111110011111110110001110010100011111100111111110011011011010100111111001111111011010111000100001111110011111100111111001111110011111111100111101111110011111100111111 3f3f3fe7bf3f3fb0e33f3fb1ca3f3fcdb53f3fb5c43f3f3f3f3fe7bf3f3f
UTF-8 捻뚭였膺곭솈違먯뒪永띕굚裕덂쉽議얇뀅捻뚭였膺곭솈 111011111010011010100100111010111001101010101101111011001001100010000000111010001000011010111010111010101011001110101101111011001000011010001000111010011000000110010101111010111010100010101111111010111001001010101010111001101011000010111000111010111001110110010101111010101011010110011010111010001010001110010101111010111000110110000010111011001000100110111101111010001010110110110000111011001001011010000111111010111000000010000101111011111010011010100100111010111001101010101101111011001001100010000000111010001000011010111010111010101011001110101101111011001000011010001000 efa6a4eb9aadec9880e886baeab3adec8688e98195eba8afeb92aae6b0b8eb9d95eab59ae8a395eb8d82ec89bde8adb0ec9687eb8085efa6a4eb9aadec9880e886baeab3adec8688
UHC 捻뚭였膺곭솈違먯뒪永띕굚裕덂쉽議얇뀅捻뚭였膺곭솈 111001101111011110001100111010101011111110110100111010111110110010000001111001111001100110001100111010101101111010010000111011001000101010100100111001111011010110110110111010111000001010000010111010111010111010001000111001011011110110110001111011001010000110111110111000111000010110000001111001101111011110001100111010101011111110110100111010111110110010000001111001111001100110001100 e6f78ceabfb4ebec81e7998ceade90ec8aa4e7b5b6eb8282ebae88e5bdb1eca1bee38581e6f78ceabfb4ebec81e7998c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)