To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 鵝??兀?ⅹ言??U}鵝??兀?ⅹ言??U{^ 11101010010000000011111100111111100110010101100100111111111110100100100110001100101111100011111100111111010101010111110111101010010000000011111100111111100110010101100100111111111110100100100110001100101111100011111100111111010101010111101101011110 ea403f3f99593ffa498cbe3f3f557dea403f3f99593ffa498cbe3f3f557b5e
EUC-JP 鵝??兀??言??U}鵝??兀??言??U{^ 1111001110100001001111110011111111010001101110100011111100111111101110001100000000111111001111110101010101111101111100111010000100111111001111111101000110111010001111110011111110111000110000000011111100111111010101010111101101011110 f3a13f3fd1ba3f3fb8c03f3f557df3a13f3fd1ba3f3fb8c03f3f557b5e
UTF-8 鵝녶컟兀덂ⅹ言됧숱U}鵝녶컟兀덂ⅹ言됧숱U{^ 1110100110110101100111011110101110000101101101101110110010111011100111111110010110000101100000001110101110001101100000101110001010000101101110011110100010101000100000001110101110010000101001111110110010001000101100010101010101111101111010011011010110011101111010111000010110110110111011001011101110011111111001011000010110000000111010111000110110000010111000101000010110111001111010001010100010000000111010111001000010100111111011001000100010110001010101010111101101011110 e9b59deb85b6ecbb9fe58580eb8d82e285b9e8a880eb90a7ec88b1557de9b59deb85b6ecbb9fe58580eb8d82e285b9e8a880eb90a7ec88b1557b5e
UHC 鵝녶컟兀덂ⅹ言됧숱U}鵝녶컟兀덂ⅹ言됧숱U{^ 1110010010111101100001101110010110110000100010101110100010110100100010001110010110100101101010101110010111101011100010011110010110111101101000100101010101111101111001001011110110000110111001011011000010001010111010001011010010001000111001011010010110101010111001011110101110001001111001011011110110100010010101010111101101011110 e4bd86e5b08ae8b488e5a5aae5eb89e5bda2557de4bd86e5b08ae8b488e5a5aae5eb89e5bda2557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)