To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 褥?∥韋??膺???ル?猷??碎κ?佯 11100101111100010011111110000001011000011110100011101000001111110011111111100100010111100011111100111111001111111000001110001011001111111001011101010001001111110011111111100001111010101000001111001000001111111001100011010001 e5f13f8161e8e83f3fe45e3f3f3f838b3f97513f3fe1ea83c83f98d1
EUC-JP 褥?‖韋??膺???ル?猷??碎κ?佯 11101010111100110011111110100001110000101111000011101010001111110011111111100111101111110011111100111111001111111010010111101011001111111100110110110010001111110011111111100010111011001010011011001010001111111101000011010011 eaf33fa1c2f0ea3f3fe7bf3f3f3fa5eb3fcdb23f3fe2eca6ca3fd0d3
UTF-8 褥띕∥韋뤸꼷膺용겱曆ル봾猷녻첑碎κ땀佯 1110100010100100101001011110101110011101100101011110001010001000101001011110100110011111100010111110101110100100101110001110101010111100101101111110100010000110101110101110110010011010101010011110101010110010101100011110111110100110100010111110001110000011101010111110101110110100101111101110011110001100101101111110101110000101101110111110110010110010100100011110011110100010100011101100111010111010111010111001010110000000111001001011110110101111 e8a4a5eb9d95e288a5e99f8beba4b8eabcb7e886baec9aa9eab2b1efa68be383abebb4bee78cb7eb85bbecb291e7a28ecebaeb9580e4bdaf
UHC 褥띕∥韋뤸꼷膺용겱曆ル봾猷녻첑碎κ땀佯 1110100110110011101101101110101110100001101010111110101011011111100011111110011010000100100011111110101111101100101111111110101110000001101111011110011010110111101010111110101110010100100001011110101110100011100001101110100010101010100111101110000111101111101001011110101010110110101000011110010110111010 e9b3b6eba1abeadf8fe6848febecbfeb81bde6b7abeb9485eba386e8aa9ee1efa5eab6a1e5ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)