To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????}????{^ 0011111100111111001111110011111101111101001111110011111100111111001111110111101101011110 3f3f3f3f7d3f3f3f3f7b5e
SJIS-WIN 歎息歎造}歎息歎造{^ 10010010010101101001000110100111100100100101011010010001101000100111110110010010010101101001000110100111100100100101011010010001101000100111101101011110 925691a7925691a27d925691a7925691a27b5e
EUC-JP 歎息歎造}歎息歎造{^ 11000011101101111100001010101001110000111011011111000010101001000111110111000011101101111100001010101001110000111011011111000010101001000111101101011110 c3b7c2a9c3b7c2a47dc3b7c2a9c3b7c2a47b5e
UTF-8 歎息歎造}歎息歎造{^ 111001101010110110001110111001101000000110101111111001101010110110001110111010011000000010100000011111011110011010101101100011101110011010000001101011111110011010101101100011101110100110000000101000000111101101011110 e6ad8ee681afe6ad8ee980a07de6ad8ee681afe6ad8ee980a07b5e
UHC 歎息歎造}歎息歎造{^ 11110111101001111110001111010011111101111010011111110000111000110111110111110111101001111110001111010011111101111010011111110000111000110111101101011110 f7a7e3d3f7a7f0e37df7a7e3d3f7a7f0e37b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)