To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蟯??節??節ラМ???蟯??節??節ラМ???^ 1110010110110010001111110011111110010000110111110011111100111111100100001101111110000011100010011000010001001101001111110011111100111111111001011011001000111111001111111001000011011111001111110011111110010000110111111000001110001001100001000100110100111111001111110011111101011110 e5b23f3f90df3f3f90df8389844d3f3f3fe5b23f3f90df3f3f90df8389844d3f3f3f5e
EUC-JP 蟯??節??節ラМ???蟯??節??節ラМ???^ 1110101010110100001111110011111111000000111000010011111100111111110000001110000110100101111010011010011110101110001111110011111100111111111010101011010000111111001111111100000011100001001111110011111111000000111000011010010111101001101001111010111000111111001111110011111101011110 eab43f3fc0e13f3fc0e1a5e9a7ae3f3f3feab43f3fc0e13f3fc0e1a5e9a7ae3f3f3f5e
UTF-8 蟯얏㎛節뗨웶節ラМ略깍슨蟯얏㎛節뗨웶節ラМ略깍스^ 1110100010011111101011111110110010010110100011111110001110001110100110111110011110101111100000001110101110010111101010001110110010011011101101101110011110101111100000001110001110000011101010011101000010011100111011111010010110110110111010101011100110001101111011001000101010101000111010001001111110101111111011001001011010001111111000111000111010011011111001111010111110000000111010111001011110101000111011001001101110110110111001111010111110000000111000111000001110101001110100001001110011101111101001011011011011101010101110011000110111101100100010101010010001011110 e89fafec968fe38e9be7af80eb97a8ec9bb6e7af80e383a9d09cefa5b6eab98dec8aa8e89fafec968fe38e9be7af80eb97a8ec9bb6e7af80e383a9d09cefa5b6eab98dec8aa45e
UHC 蟯얏㎛節뗨웶節ラМ略깍슨蟯얏㎛節뗨웶節ラМ略깍스^ 11101001101010001011111011100110101001111010110111101111101111011000101111101000100111111000010011101111101111011010101111101001101011001010111011100101101100101011000111101111101111011011110011101001101010001011111011100110101001111010110111101111101111011000101111101000100111111000010011101111101111011010101111101001101011001010111011100101101100101011000111101111101111011011101001011110 e9a8bee6a7adefbd8be89f84efbdabe9acaee5b2b1efbdbce9a8bee6a7adefbd8be89f84efbdabe9acaee5b2b1efbdba5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)