To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????un}????????un{^ 0011111100111111001111110011111100111111001111110011111100111111011101010110111001111101001111110011111100111111001111110011111100111111001111110011111101110101011011100111101101011110 3f3f3f3f3f3f3f3f756e7d3f3f3f3f3f3f3f3f756e7b5e
SJIS-WIN 鱆軸鴆竺鱆軸炅漆un}鱆軸鴆竺鱆軸炅漆un{^ 111010011110000110001110101100101110100111101111100011101011000111101001111000011000111010110010111110110101000110001110101111010111010101101110011111011110100111100001100011101011001011101001111011111000111010110001111010011110000110001110101100101111101101010001100011101011110101110101011011100111101101011110 e9e18eb2e9ef8eb1e9e18eb2fb518ebd756e7de9e18eb2e9ef8eb1e9e18eb2fb518ebd756e7b5e
EUC-JP 鱆軸鴆竺鱆軸炅漆un}鱆軸鴆竺鱆軸炅漆un{^ 1111001011100011101111001011010011110010111100011011110010110011111100101110001110111100101101001000111111001001110010101011110010111111011101010110111001111101111100101110001110111100101101001111001011110001101111001011001111110010111000111011110010110100100011111100100111001010101111001011111101110101011011100111101101011110 f2e3bcb4f2f1bcb3f2e3bcb48fc9cabcbf756e7df2e3bcb4f2f1bcb3f2e3bcb48fc9cabcbf756e7b5e
UTF-8 鱆軸鴆竺鱆軸炅漆un}鱆軸鴆竺鱆軸炅漆un{^ 11101001101100011000011011101000101110111011100011101001101101001000011011100111101010111011101011101001101100011000011011101000101110111011100011100111100000101000010111100110101111001000011001110101011011100111110111101001101100011000011011101000101110111011100011101001101101001000011011100111101010111011101011101001101100011000011011101000101110111011100011100111100000101000010111100110101111001000011001110101011011100111101101011110 e9b186e8bbb8e9b486e7abbae9b186e8bbb8e78285e6bc86756e7de9b186e8bbb8e9b486e7abbae9b186e8bbb8e78285e6bc86756e7b5e
UHC ?軸?竺?軸炅漆un}?軸?竺?軸炅漆un{^ 001111111111010111101110001111111111010111100111001111111111010111101110110011001101110111110110110101000111010101101110011111010011111111110101111011100011111111110101111001110011111111110101111011101100110011011101111101101101010001110101011011100111101101011110 3ff5ee3ff5e73ff5eeccddf6d4756e7d3ff5ee3ff5e73ff5eeccddf6d4756e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)