To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 也??泣??宜? 1001011011100111001111110011111110001011100000110011111100111111100010110101100000111111 96e73f3f8b833f3f8b583f
EUC-JP 也??泣??宜? 1100110011101001001111110011111110110101111000110011111100111111101101011011100100111111 cce93f3fb5e33f3fb5b93f
UTF-8 也㏆퐠泣녽펶宜밄 111001001011100110011111111000111000111110000110111011011001000010100000111001101011001110100011111010111000010110111101111011011000111010110110111001011010111010011100111010111011000010000100 e4b99fe38f86ed90a0e6b3a3eb85bded8eb6e5ae9cebb084
UHC 也㏆퐠泣녽펶宜밄 11100101101001011010011111101111101111011000100111101011111010001000011011101001101111001000011111101011111100011001001101000010 e5a5a7efbd89ebe886e9bc87ebf19342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)