To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???陰??癲??濡?????意э┷濡??^ 0011111100111111001111111000100101000001001111110011111111100001100111110011111100111111100101000100011100111111001111110011111100111111001111111000100011010011100001001000111110000100101110001001010001000111001111110011111101011110 3f3f3f89413f3fe19f3f3f94473f3f3f3f3f88d3848f84b894473f3f5e
EUC-JP ???陰??癲??濡?????意э┷濡??^ 0011111100111111001111111011000110100010001111110011111111100010101000010011111100111111110001111010100000111111001111110011111100111111001111111011000011010101101001111110111110101000101110101100011110101000001111110011111101011110 3f3f3fb1a23f3fe2a13f3fc7a83f3f3f3f3fb0d5a7efa8bac7a83f3f5e
UTF-8 溜깅젡陰먯뵮癲븍죭濡쀫젿溜싳넀意э┷濡덈죻^ 111011111010011110001011111010101011100110000101111011001010000010100001111010011001100110110000111010111010100010101111111010111011010110101110111001111001100110110010111010111011100010001101111011001010001110101101111001101011111110100001111011001000000010101011111011001010000010111111111011111010011110001011111011001000101110110011111010111000010010000000111001101000010010001111110100011000110111100010100101001011011111100110101111111010000111101011100011011000100011101100101000111011101101011110 efa78beab985eca0a1e999b0eba8afebb5aee799b2ebb88deca3ade6bfa1ec80abeca0bfefa78bec8bb3eb8480e6848fd18de294b7e6bfa1eb8d88eca3bb5e
UHC 溜깅젡陰먯뵮癲븍죭濡쀫젿溜싳넀意э┷濡덈죻^ 11101010111111101011000111101011101000001001101011101011111001001001000011101100100101001010110011101111101001101011101011101011101000011000100011101011101000011001011111101011101000001011000111101010111111101001101011101100100001101001000011101011111100101010110011101111101001101011101011101011101000011000100011101011101000011001010101011110 eafeb1eba09aebe490ec94acefa6baeba188eba197eba0b1eafe9aec8690ebf2acefa6baeba188eba1955e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)