To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???zW^???z\}v???zW^???z\}vB 001111110011111100111111011110100101011101011110001111110011111100111111011110100101110001111101011101100011111100111111001111110111101001010111010111100011111100111111001111110111101001011100011111010111011001000010 3f3f3f7a575e3f3f3f7a5c7d763f3f3f7a575e3f3f3f7a5c7d7642
SJIS-WIN 紆??zW^紆??z\}v紆??zW^紆??z\}vB 11100010111111000011111100111111011110100101011101011110111000101111110000111111001111110111101001011100011111010111011011100010111111000011111100111111011110100101011101011110111000101111110000111111001111110111101001011100011111010111011001000010 e2fc3f3f7a575ee2fc3f3f7a5c7d76e2fc3f3f7a575ee2fc3f3f7a5c7d7642
EUC-JP 紆??zW^紆??z\}v紆??zW^紆??z\}vB 11100100111111100011111100111111011110100101011101011110111001001111111000111111001111110111101001011100011111010111011011100100111111100011111100111111011110100101011101011110111001001111111000111111001111110111101001011100011111010111011001000010 e4fe3f3f7a575ee4fe3f3f7a5c7d76e4fe3f3f7a575ee4fe3f3f7a5c7d7642
UTF-8 紆숷숥zW^紆숷숥z\}v紆숷숥zW^紆숷숥z\}vB 111001111011010010000110111011001000100010110111111011001000100010100101011110100101011101011110111001111011010010000110111011001000100010110111111011001000100010100101011110100101110001111101011101101110011110110100100001101110110010001000101101111110110010001000101001010111101001010111010111101110011110110100100001101110110010001000101101111110110010001000101001010111101001011100011111010111011001000010 e7b486ec88b7ec88a57a575ee7b486ec88b7ec88a57a5c7d76e7b486ec88b7ec88a57a575ee7b486ec88b7ec88a57a5c7d7642
UHC 紆숷숥zW^紆숷숥z\}v紆숷숥zW^紆숷숥z\}vB 111010011110000110011010010011001001101001000010011110100101011101011110111010011110000110011010010011001001101001000010011110100101110001111101011101101110100111100001100110100100110010011010010000100111101001010111010111101110100111100001100110100100110010011010010000100111101001011100011111010111011001000010 e9e19a4c9a427a575ee9e19a4c9a427a5c7d76e9e19a4c9a427a575ee9e19a4c9a427a5c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)