To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????\}?????????\{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101110001111101001111110011111100111111001111110011111100111111001111110011111100111111010111000111101101011110 3f3f3f3f3f3f3f3f3f5c7d3f3f3f3f3f3f3f3f3f5c7b5e
SJIS-WIN 鵝??兀?ⅹ言??\}鵝??兀?ⅹ言??\{^ 11101010010000000011111100111111100110010101100100111111111110100100100110001100101111100011111100111111010111000111110111101010010000000011111100111111100110010101100100111111111110100100100110001100101111100011111100111111010111000111101101011110 ea403f3f99593ffa498cbe3f3f5c7dea403f3f99593ffa498cbe3f3f5c7b5e
EUC-JP 鵝??兀??言??\}鵝??兀??言??\{^ 1111001110100001001111110011111111010001101110100011111100111111101110001100000000111111001111110101110001111101111100111010000100111111001111111101000110111010001111110011111110111000110000000011111100111111010111000111101101011110 f3a13f3fd1ba3f3fb8c03f3f5c7df3a13f3fd1ba3f3fb8c03f3f5c7b5e
UTF-8 鵝녶컟兀덂ⅹ言됧숱\}鵝녶컟兀덂ⅹ言됧숱\{^ 1110100110110101100111011110101110000101101101101110110010111011100111111110010110000101100000001110101110001101100000101110001010000101101110011110100010101000100000001110101110010000101001111110110010001000101100010101110001111101111010011011010110011101111010111000010110110110111011001011101110011111111001011000010110000000111010111000110110000010111000101000010110111001111010001010100010000000111010111001000010100111111011001000100010110001010111000111101101011110 e9b59deb85b6ecbb9fe58580eb8d82e285b9e8a880eb90a7ec88b15c7de9b59deb85b6ecbb9fe58580eb8d82e285b9e8a880eb90a7ec88b15c7b5e
UHC 鵝녶컟兀덂ⅹ言됧숱\}鵝녶컟兀덂ⅹ言됧숱\{^ 1110010010111101100001101110010110110000100010101110100010110100100010001110010110100101101010101110010111101011100010011110010110111101101000100101110001111101111001001011110110000110111001011011000010001010111010001011010010001000111001011010010110101010111001011110101110001001111001011011110110100010010111000111101101011110 e4bd86e5b08ae8b488e5a5aae5eb89e5bda25c7de4bd86e5b08ae8b488e5a5aae5eb89e5bda25c7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)