To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 偲璽偲嫉篠、トセトス、偲璽偲嫉篠、トセトス、B 100011101100001110001110101000111000111011000011100011101011100110001110110000101010010011000100101111101100010010111101101001001000111011000011100011101010001110001110110000111000111010111001100011101100001010100100110001001011111011000100101111011010010001000010 8ec38ea38ec38eb98ec2a4c4bec4bda48ec38ea38ec38eb98ec2a4c4bec4bda442
EUC-JP 偲璽偲嫉篠、トセトス、偲璽偲嫉篠、トセトス、B 101111001100010110111100101001011011110011000101101111001011101110111100110001001000111010100100100011101100010010001110101111101000111011000100100011101011110110001110101001001011110011000101101111001010010110111100110001011011110010111011101111001100010010001110101001001000111011000100100011101011111010001110110001001000111010111101100011101010010001000010 bcc5bca5bcc5bcbbbcc48ea48ec48ebe8ec48ebd8ea4bcc5bca5bcc5bcbbbcc48ea48ec48ebe8ec48ebd8ea442
UTF-8 偲璽偲嫉篠、トセトス、偲璽偲嫉篠、トセトス、B 11100101100000011011001011100111100100101011110111100101100000011011001011100101101010111000100111100111101011111010000011101111101111011010010011101111101111101000010011101111101111011011111011101111101111101000010011101111101111011011110111101111101111011010010011100101100000011011001011100111100100101011110111100101100000011011001011100101101010111000100111100111101011111010000011101111101111011010010011101111101111101000010011101111101111011011111011101111101111101000010011101111101111011011110111101111101111011010010001000010 e581b2e792bde581b2e5ab89e7afa0efbda4efbe84efbdbeefbe84efbdbdefbda4e581b2e792bde581b2e5ab89e7afa0efbda4efbe84efbdbeefbe84efbdbdefbda442
UHC ?璽?嫉篠???????璽?嫉篠??????B 0011111111011111110111100011111111110010111011001110000111000110001111110011111100111111001111110011111100111111001111111101111111011110001111111111001011101100111000011100011000111111001111110011111100111111001111110011111101000010 3fdfde3ff2ece1c63f3f3f3f3f3f3fdfde3ff2ece1c63f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)