To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊?????烏??肄??酉??熬??逾 00111111001111110011111111100010100001100011111100111111001111110011111100111111100010010100011100111111001111111110001111100101001111110011111110010011110100010011111100111111111000001001001000111111001111111110011110100101 3f3f3fe2863f3f3f3f3f89473f3fe3e53f3f93d13f3fe0923f3fe7a5
EUC-JP ???竊??洧??烏??肄??酉??熬??逾 001111110011111100111111111000111110011000111111001111111000111111000111101101000011111100111111101100011010100000111111001111111110011011100111001111110011111111000110110100110011111100111111110111111111001000111111001111111110111010100111 3f3f3fe3e63f3f8fc7b43f3fb1a83f3fe6e73f3fc6d33f3fdff23f3feea7
UTF-8 捻뀁뮆竊섉꼷洧댄뒅烏겸뫕肄잍콨酉멸석熬곻퐣逾 111011111010011010100100111010111000000010000001111010111010111010000110111001111010101110001010111011001000010010001001111010101011110010110111111001101011010010100111111010111000110010000100111010111001001010000101111001111000001110001111111010101011001010111000111010111010101110010101111010001000001010000100111011001001111010001101111011001011110110101000111010011000010110001001111010111010100110111000111011001000010010011101111001111000011010101100111010101011001110111011111011011001000010100011111010011000000010111110 efa6a4eb8081ebae86e7ab8aec8489eabcb7e6b4a7eb8c84eb9285e7838feab2b8ebab95e88284ec9e8decbda8e98589eba9b8ec849de786aceab3bbed90a3e980be
UHC 捻뀁뮆竊섉꼷洧댄뒅烏겸뫕肄잍콨酉멸석熬곻퐣逾 1110011011110111101100101110110010010010100101011110111110111100100110001110011010000100100011111110101011111011101101001110110110001010100000111110100010100001101100001110001010010001101101111110110010111101100111111110011010110001100111011110101110110111101110001110101010111100101011101110100010100010100000011110111110111101100011001110101110110101 e6f7b2ec9295efbc98e6848feafbb4ed8a83e8a1b0e291b7ecbd9fe6b19debb7b8eabcaee8a281efbd8cebb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)