To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????J}?????????J{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100101001111101001111110011111100111111001111110011111100111111001111110011111100111111010010100111101101011110 3f3f3f3f3f3f3f3f3f4a7d3f3f3f3f3f3f3f3f3f4a7b5e
SJIS-WIN ?ゆ???????J}?ゆ???????J{^ 00111111100000101110010000111111001111110011111100111111001111110011111100111111010010100111110100111111100000101110010000111111001111110011111100111111001111110011111100111111010010100111101101011110 3f82e43f3f3f3f3f3f3f4a7d3f82e43f3f3f3f3f3f3f4a7b5e
EUC-JP ?ゆ????渶??J}?ゆ????渶??J{^ 0011111110100100111001100011111100111111001111110011111110001111110001111110110100111111001111110100101001111101001111111010010011100110001111110011111100111111001111111000111111000111111011010011111100111111010010100111101101011110 3fa4e63f3f3f3f8fc7ed3f3f4a7d3fa4e63f3f3f3f8fc7ed3f3f4a7b5e
UTF-8 獵ゆ뿃掠묊쉿渶싧뜥J}獵ゆ뿃掠묊쉿渶싧뜥J{^ 1110111110100110101001111110001110000010100001101110101110111111100000111110111110100101101101011110101110101100100010101110110010001001101111111110011010111000101101101110110010001011101001111110101110011100101001010100101001111101111011111010011010100111111000111000001010000110111010111011111110000011111011111010010110110101111010111010110010001010111011001000100110111111111001101011100010110110111011001000101110100111111010111001110010100101010010100111101101011110 efa6a7e38286ebbf83efa5b5ebac8aec89bfe6b8b6ec8ba7eb9ca54a7defa6a7e38286ebbf83efa5b5ebac8aec89bfe6b8b6ec8ba7eb9ca54a7b5e
UHC 獵ゆ뿃掠묊쉿渶싧뜥J}獵ゆ뿃掠묊쉿渶싧뜥J{^ 1110011110100110101010101110011010010111100010111110010110110001100100011110011110111101101100101110011110110111100110101110010110001101101010000100101001111101111001111010011010101010111001101001011110001011111001011011000110010001111001111011110110110010111001111011011110011010111001011000110110101000010010100111101101011110 e7a6aae6978be5b191e7bdb2e7b79ae58da84a7de7a6aae6978be5b191e7bdb2e7b79ae58da84a7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)