To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 爾??????? 100011101010001000111111001111110011111100111111001111110011111100111111 8ea23f3f3f3f3f3f3f
EUC-JP 爾??????? 101111001010010000111111001111110011111100111111001111110011111100111111 bca43f3f3f3f3f3f3f
UTF-8 爾재렒펨렠재렒탓 111001111000100010111110111011001001111010101100111010111010000010010010111011011000111010101000111010111010000010100000111011001001111010101100111010111010000010010010111011011000001110010011 e788beec9eaceba092ed8ea8eba0a0ec9eaceba092ed8393
UHC 爾재렒펨렠재렒탓 11101100101100111100000011100111100011101010011111000110111010001000111010110001110000001110011110001110101001111100010110111111 ecb3c0e78ea7c6e88eb1c0e78ea7c5bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)