To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^}?????????^{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101111001111101001111110011111100111111001111110011111100111111001111110011111100111111010111100111101101011110 3f3f3f3f3f3f3f3f3f5e7d3f3f3f3f3f3f3f3f3f5e7b5e
SJIS-WIN ???陰??惟??^}???陰??惟??^{^ 001111110011111100111111100010010100000100111111001111111000100011010010001111110011111101011110011111010011111100111111001111111000100101000001001111110011111110001000110100100011111100111111010111100111101101011110 3f3f3f89413f3f88d23f3f5e7d3f3f3f89413f3f88d23f3f5e7b5e
EUC-JP ???陰??惟??^}???陰??惟??^{^ 001111110011111100111111101100011010001000111111001111111011000011010100001111110011111101011110011111010011111100111111001111111011000110100010001111110011111110110000110101000011111100111111010111100111101101011110 3f3f3fb1a23f3fb0d43f3f5e7d3f3f3fb1a23f3fb0d43f3f5e7b5e
UTF-8 捻뀀슙陰욜솾惟㏃뒡^}捻뀀슙陰욜솾惟㏃뒡^{^ 1110111110100110101001001110101110000000100000001110110010001010100110011110100110011001101100001110110010011010100111001110110010000110101111101110011010000011100111111110001110001111100000111110101110010010101000010101111001111101111011111010011010100100111010111000000010000000111011001000101010011001111010011001100110110000111011001001101010011100111011001000011010111110111001101000001110011111111000111000111110000011111010111001001010100001010111100111101101011110 efa6a4eb8080ec8a99e999b0ec9a9cec86bee6839fe38f83eb92a15e7defa6a4eb8080ec8a99e999b0ec9a9cec86bee6839fe38f83eb92a15e7b5e
UHC 捻뀀슙陰욜솾惟㏃뒡^}捻뀀슙陰욜솾惟㏃뒡^{^ 1110011011110111101100101110101110011010101001111110101111100100101111111110011110011001101100101110101011101110101001111110110010001010100111010101111001111101111001101111011110110010111010111001101010100111111010111110010010111111111001111001100110110010111010101110111010100111111011001000101010011101010111100111101101011110 e6f7b2eb9aa7ebe4bfe799b2eaeea7ec8a9d5e7de6f7b2eb9aa7ebe4bfe799b2eaeea7ec8a9d5e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)