To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 閭也噫閭也噫^ 11101000100000111001011011100111100110101000000011101000100000111001011011100111100110101000000001011110 e88396e79a80e88396e79a805e
EUC-JP 閭也噫閭也噫^ 11101111111000111100110011101001110100111110000011101111111000111100110011101001110100111110000001011110 efe3cce9d3e0efe3cce9d3e05e
UTF-8 閭也噫閭也噫^ 11101001100101101010110111100100101110011001111111100101100110011010101111101001100101101010110111100100101110011001111111100101100110011010101101011110 e996ade4b99fe599abe996ade4b99fe599ab5e
UHC 閭也噫閭也噫^ 11010101111011111110010110100101111111011110110111010101111011111110010110100101111111011110110101011110 d5efe5a5fdedd5efe5a5fded5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)