To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????i??????iB 001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f693f3f3f3f3f3f6942
SJIS-WIN 歎測旦但坦袖i歎測旦但坦袖iB 100100100101011010010001101010101001001001010101100100100100000110010010010100101001000110110011011010011001001001010110100100011010101010010010010101011001001001000001100100100101001010010001101100110110100101000010 925691aa92559241925291b369925691aa92559241925291b36942
EUC-JP 歎測旦但坦袖i歎測旦但坦袖iB 110000111011011111000010101011001100001110110110110000111010001011000011101100111100001010110101011010011100001110110111110000101010110011000011101101101100001110100010110000111011001111000010101101010110100101000010 c3b7c2acc3b6c3a2c3b3c2b569c3b7c2acc3b6c3a2c3b3c2b56942
UTF-8 歎測旦但坦袖i歎測旦但坦袖iB 111001101010110110001110111001101011100010101100111001101001011110100110111001001011110110000110111001011001110110100110111010001010001010010110011010011110011010101101100011101110011010111000101011001110011010010111101001101110010010111101100001101110010110011101101001101110100010100010100101100110100101000010 e6ad8ee6b8ace697a6e4bd86e59da6e8a29669e6ad8ee6b8ace697a6e4bd86e59da6e8a2966942
UHC 歎測旦但坦袖i歎測旦但坦袖iB 111101111010011111110110101101001101001110101001110100111010001111110111101001001110001011000000011010011111011110100111111101101011010011010011101010011101001110100011111101111010010011100010110000000110100101000010 f7a7f6b4d3a9d3a3f7a4e2c069f7a7f6b4d3a9d3a3f7a4e2c06942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)