To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 甑?肄?兵池???甑?肄?兵池???^ 100011011001100100111111111000111110010100111111100101011011101010010010011100100011111100111111001111111000110110011001001111111110001111100101001111111001010110111010100100100111001000111111001111110011111101011110 8d993fe3e53f95ba92723f3f3f8d993fe3e53f95ba92723f3f3f5e
EUC-JP 甑?肄?兵池?橒?甑?肄?兵池?橒?^ 10111001111110010011111111100110111001110011111111001010101111001100001111010011001111111000111111000101101011010011111110111001111110010011111111100110111001110011111111001010101111001100001111010011001111111000111111000101101011010011111101011110 b9f93fe6e73fcabcc3d33f8fc5ad3fb9f93fe6e73fcabcc3d33f8fc5ad3f5e
UTF-8 甑렏肄펠兵池렞橒렡甑렏肄펠兵池렞橒렣^ 11100111100101001001000111101011101000001000111111101000100000101000010011101101100011101010000011100101100001011011010111100110101100011010000011101011101000001001111011100110101010011001001011101011101000001010000111100111100101001001000111101011101000001000111111101000100000101000010011101101100011101010000011100101100001011011010111100110101100011010000011101011101000001001111011100110101010011001001011101011101000001010001101011110 e79491eba08fe88284ed8ea0e585b5e6b1a0eba09ee6a992eba0a1e79491eba08fe88284ed8ea0e585b5e6b1a0eba09ee6a992eba0a35e
UHC 甑렏肄펠兵池렞橒렡甑렏肄펠兵池렞橒렣^ 11110001111101111000111010100101111011001011110111000110111001111101110010110010111100101010111010001110101011111110100111111000100011101011001011110001111101111000111010100101111011001011110111000110111001111101110010110010111100101010111010001110101011111110100111111000100011101011010001011110 f1f78ea5ecbdc6e7dcb2f2ae8eafe9f88eb2f1f78ea5ecbdc6e7dcb2f2ae8eafe9f88eb45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)