To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥〓?二у?怨?.嚥 1001101010001011100000011010110000111111100100111111000110000100100001010011111110001001100001010011111110000001010001001001101010001011 9a8b81ac3f93f184853f89853f81449a8b
EUC-JP 嚥〓?二у?怨?.嚥 1101001111101011101000101010111000111111110001101111001110100111111001010011111110110001111001010011111110100001101001011101001111101011 d3eba2ae3fc6f3a7e53fb1e53fa1a5d3eb
UTF-8 嚥〓쉴二у슫怨룹.嚥 1110010110011010101001011110001110000000100100111110110010001001101101001110010010111010100011001101000110000011111011001000101010101011111001101000000010101000111010111010001110111001111011111011110010001110111001011001101010100101 e59aa5e38093ec89b4e4ba8cd183ec8aabe680a8eba3b9efbc8ee59aa5
UHC 嚥〓쉴二у슫怨룹.嚥 1110011010111111101000011110101110111101101011111110110010100011101011001110010110011010101101001110101010110011101101111110110010100011101011101110011010111111 e6bfa1ebbdafeca3ace59ab4eab3b7eca3aee6bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)