To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????æ???}v?????æ???}vB 0011111100111111001111110011111100111111111001100011111100111111001111110111110101110110001111110011111100111111001111110011111111100110001111110011111100111111011111010111011001000010 3f3f3f3f3fe63f3f3f7d763f3f3f3f3fe63f3f3f7d7642
SJIS-WIN ?→?泣??業??}v?→?泣??業??}vB 0011111110000001101010000011111110001011100000110011111100111111100010111100011000111111001111110111110101110110001111111000000110101000001111111000101110000011001111110011111110001011110001100011111100111111011111010111011001000010 3f81a83f8b833f3f8bc63f3f7d763f81a83f8b833f3f8bc63f3f7d7642
EUC-JP 艅→?泣?æ業??}v艅→?泣?æ業??}vB 10001111110101101111110110100010101010100011111110110101111000110011111110001111101010011100000110110110110010000011111100111111011111010111011010001111110101101111110110100010101010100011111110110101111000110011111110001111101010011100000110110110110010000011111100111111011111010111011001000010 8fd6fda2aa3fb5e33f8fa9c1b6c83f3f7d768fd6fda2aa3fb5e33f8fa9c1b6c83f3f7d7642
UTF-8 艅→삜泣녔æ業잍츝}v艅→삜泣녔æ業잍츝}vB 111010001000100110000101111000101000011010010010111011001000001010011100111001101011001110100011111010111000010110010100110000111010011011100110101001011010110111101100100111101000110111101100101110001001110101111101011101101110100010001001100001011110001010000110100100101110110010000010100111001110011010110011101000111110101110000101100101001100001110100110111001101010010110101101111011001001111010001101111011001011100010011101011111010111011001000010 e88985e28692ec829ce6b3a3eb8594c3a6e6a5adec9e8decb89d7d76e88985e28692ec829ce6b3a3eb8594c3a6e6a5adec9e8decb89d7d7642
UHC 艅→삜泣녔æ業잍츝}v艅→삜泣녔æ業잍츝}vB 1110011010101001101000011110011010011000100111111110101111101000101100111110011010101001101000011110010111110110100111111110011010101110100101100111110101110110111001101010100110100001111001101001100010011111111010111110100010110011111001101010100110100001111001011111011010011111111001101010111010010110011111010111011001000010 e6a9a1e6989febe8b3e6a9a1e5f69fe6ae967d76e6a9a1e6989febe8b3e6a9a1e5f69fe6ae967d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)