To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 埇??泣?┏銀?┓埇??泣?┏銀?┓^ 1111101010011010001111110011111110001011100000110011111110000100101011001000101111100010001111111000010010101101111110101001101000111111001111111000101110000011001111111000010010101100100010111110001000111111100001001010110101011110 fa9a3f3f8b833f84ac8be23f84adfa9a3f3f8b833f84ac8be23f84ad5e
EUC-JP 埇??泣?┏銀?┓埇??泣?┏銀?┓^ 10001111101101111110011100111111001111111011010111100011001111111010100010101110101101101110010000111111101010001010111110001111101101111110011100111111001111111011010111100011001111111010100010101110101101101110010000111111101010001010111101011110 8fb7e73f3fb5e33fa8aeb6e43fa8af8fb7e73f3fb5e33fa8aeb6e43fa8af5e
UTF-8 埇쎄내泣곫┏銀㏃┓埇쎄내泣곫┏銀㏃┓^ 11100101100111111000011111101100100011101000010011101011100000101011010011100110101100111010001111101010101100111010101111100010100101001000111111101001100010101000000011100011100011111000001111100010100101001001001111100101100111111000011111101100100011101000010011101011100000101011010011100110101100111010001111101010101100111010101111100010100101001000111111101001100010101000000011100011100011111000001111100010100101001001001101011110 e59f87ec8e84eb82b4e6b3a3eab3abe2948fe98a80e38f83e29493e59f87ec8e84eb82b4e6b3a3eab3abe2948fe98a80e38f83e294935e
UHC 埇쎄내泣곫┏銀㏃┓埇쎄내泣곫┏銀㏃┓^ 11101001101110011011110111101010101100111011101111101011111010001000000111100110101001101010111011101011110111101010011111101100101001101010111111101001101110011011110111101010101100111011101111101011111010001000000111100110101001101010111011101011110111101010011111101100101001101010111101011110 e9b9bdeab3bbebe881e6a6aeebdea7eca6afe9b9bdeab3bbebe881e6a6aeebdea7eca6af5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)