To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 瀚糀糀W^瀚糀糀\}v瀚糀糀W^瀚糀糀\}vB 1110000001101010111000101110111111100010111011110101011101011110111000000110101011100010111011111110001011101111010111000111110101110110111000000110101011100010111011111110001011101111010101110101111011100000011010101110001011101111111000101110111101011100011111010111011001000010 e06ae2efe2ef575ee06ae2efe2ef5c7d76e06ae2efe2ef575ee06ae2efe2ef5c7d7642
EUC-JP 瀚糀糀W^瀚糀糀\}v瀚糀糀W^瀚糀糀\}vB 1101111111001011111001001111000111100100111100010101011101011110110111111100101111100100111100011110010011110001010111000111110101110110110111111100101111100100111100011110010011110001010101110101111011011111110010111110010011110001111001001111000101011100011111010111011001000010 dfcbe4f1e4f1575edfcbe4f1e4f15c7d76dfcbe4f1e4f1575edfcbe4f1e4f15c7d7642
UTF-8 瀚糀糀W^瀚糀糀\}v瀚糀糀W^瀚糀糀\}vB 1110011110000000100110101110011110110011100000001110011110110011100000000101011101011110111001111000000010011010111001111011001110000000111001111011001110000000010111000111110101110110111001111000000010011010111001111011001110000000111001111011001110000000010101110101111011100111100000001001101011100111101100111000000011100111101100111000000001011100011111010111011001000010 e7809ae7b380e7b380575ee7809ae7b380e7b3805c7d76e7809ae7b380e7b380575ee7809ae7b380e7b3805c7d7642
UHC 瀚??W^瀚??\}v瀚??W^瀚??\}vB 111110011101010100111111001111110101011101011110111110011101010100111111001111110101110001111101011101101111100111010101001111110011111101010111010111101111100111010101001111110011111101011100011111010111011001000010 f9d53f3f575ef9d53f3f5c7d76f9d53f3f575ef9d53f3f5c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)