To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 鵝??兀?ⅹ言??U}鵝??兀?ⅹ言??U{^ 11101010010000000011111100111111100110010101100100111111111110100100100110001100101111100011111100111111010101010111110111101010010000000011111100111111100110010101100100111111111110100100100110001100101111100011111100111111010101010111101101011110 ea403f3f99593ffa498cbe3f3f557dea403f3f99593ffa498cbe3f3f557b5e
EUC-JP 鵝??兀??言??U}鵝??兀??言??U{^ 1111001110100001001111110011111111010001101110100011111100111111101110001100000000111111001111110101010101111101111100111010000100111111001111111101000110111010001111110011111110111000110000000011111100111111010101010111101101011110 f3a13f3fd1ba3f3fb8c03f3f557df3a13f3fd1ba3f3fb8c03f3f557b5e
UTF-8 鵝녶컟兀덂ⅹ言됭퀕U}鵝녶컟兀덂ⅹ言됭퀕U{^ 1110100110110101100111011110101110000101101101101110110010111011100111111110010110000101100000001110101110001101100000101110001010000101101110011110100010101000100000001110101110010000101011011110110110000000100101010101010101111101111010011011010110011101111010111000010110110110111011001011101110011111111001011000010110000000111010111000110110000010111000101000010110111001111010001010100010000000111010111001000010101101111011011000000010010101010101010111101101011110 e9b59deb85b6ecbb9fe58580eb8d82e285b9e8a880eb90aded8095557de9b59deb85b6ecbb9fe58580eb8d82e285b9e8a880eb90aded8095557b5e
UHC 鵝녶컟兀덂ⅹ言됭퀕U}鵝녶컟兀덂ⅹ言됭퀕U{^ 1110010010111101100001101110010110110000100010101110100010110100100010001110010110100101101010101110010111101011100010011110100010110011100010100101010101111101111001001011110110000110111001011011000010001010111010001011010010001000111001011010010110101010111001011110101110001001111010001011001110001010010101010111101101011110 e4bd86e5b08ae8b488e5a5aae5eb89e8b38a557de4bd86e5b08ae8b488e5a5aae5eb89e8b38a557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)