To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 譯????????[譯????????[^ 1110011010100001001111110011111100111111001111110011111100111111001111110011111101011011111001101010000100111111001111110011111100111111001111110011111100111111001111110101101101011110 e6a13f3f3f3f3f3f3f3f5be6a13f3f3f3f3f3f3f3f5b5e
EUC-JP 譯?????蓀??[譯?????蓀??[^ 111011001010001100111111001111110011111100111111001111111000111111011000111110000011111100111111010110111110110010100011001111110011111100111111001111110011111110001111110110001111100000111111001111110101101101011110 eca33f3f3f3f3f8fd8f83f3f5beca33f3f3f3f3f8fd8f83f3f5b5e
UTF-8 譯롡굙栒귝퍗蓀곥뀆[譯롡굙栒귝퍗蓀곥뀆[^ 111010001010110110101111111010111010000110100001111010101011010110011001111001101010000010010010111010101011011110011101111011011000110110010111111010001001001110000000111010101011001110100101111010111000000010000110010110111110100010101101101011111110101110100001101000011110101010110101100110011110011010100000100100101110101010110111100111011110110110001101100101111110100010010011100000001110101010110011101001011110101110000000100001100101101101011110 e8adafeba1a1eab599e6a092eab79ded8d97e89380eab3a5eb80865be8adafeba1a1eab599e6a092eab79ded8d97e89380eab3a5eb80865b5e
UHC 譯롡굙栒귝퍗蓀곥뀆[譯롡굙栒귝퍗蓀곥뀆[^ 111001101011101110001110111000101000001010000001111000101110001110000010111001101011101110001110111000011110000010000001111000111000010110000010010110111110011010111011100011101110001010000010100000011110001011100011100000101110011010111011100011101110000111100000100000011110001110000101100000100101101101011110 e6bb8ee28281e2e382e6bb8ee1e081e385825be6bb8ee28281e2e382e6bb8ee1e081e385825b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)