To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????}v????????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN ジシミシムシスショシイ}vジシミシムシスショシイ}vB 1011110011011110101111001101000010111100110100011011110010111101101111001010111010111100101100100111110101110110101111001101111010111100110100001011110011010001101111001011110110111100101011101011110010110010011111010111011001000010 bcdebcd0bcd1bcbdbcaebcb27d76bcdebcd0bcd1bcbdbcaebcb27d7642
EUC-JP ジシミシムシスショシイ}vジシミシムシスショシイ}vB 1000111010111100100011101101111010001110101111001000111011010000100011101011110010001110110100011000111010111100100011101011110110001110101111001000111010101110100011101011110010001110101100100111110101110110100011101011110010001110110111101000111010111100100011101101000010001110101111001000111011010001100011101011110010001110101111011000111010111100100011101010111010001110101111001000111010110010011111010111011001000010 8ebc8ede8ebc8ed08ebc8ed18ebc8ebd8ebc8eae8ebc8eb27d768ebc8ede8ebc8ed08ebc8ed18ebc8ebd8ebc8eae8ebc8eb27d7642
UTF-8 ジシミシムシスショシイ}vジシミシムシスショシイ}vB 1110111110111101101111001110111110111110100111101110111110111101101111001110111110111110100100001110111110111101101111001110111110111110100100011110111110111101101111001110111110111101101111011110111110111101101111001110111110111101101011101110111110111101101111001110111110111101101100100111110101110110111011111011110110111100111011111011111010011110111011111011110110111100111011111011111010010000111011111011110110111100111011111011111010010001111011111011110110111100111011111011110110111101111011111011110110111100111011111011110110101110111011111011110110111100111011111011110110110010011111010111011001000010 efbdbcefbe9eefbdbcefbe90efbdbcefbe91efbdbcefbdbdefbdbcefbdaeefbdbcefbdb27d76efbdbcefbe9eefbdbcefbe90efbdbcefbe91efbdbcefbdbdefbdbcefbdaeefbdbcefbdb27d7642
UHC ????????????}v????????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f3f3f3f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)