To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????U}??????U{^ 0011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f557d3f3f3f3f3f3f557b5e
SJIS-WIN 谷遜則巽孫孫U}谷遜則巽孫孫U{^ 1001001001001010100100011011101110010001101001011001001001000110100100011011011110010001101101110101010101111101100100100100101010010001101110111001000110100101100100100100011010010001101101111001000110110111010101010111101101011110 924a91bb91a5924691b791b7557d924a91bb91a5924691b791b7557b5e
EUC-JP 谷遜則巽孫孫U}谷遜則巽孫孫U{^ 1100001110101011110000101011110111000010101001111100001110100111110000101011100111000010101110010101010101111101110000111010101111000010101111011100001010100111110000111010011111000010101110011100001010111001010101010111101101011110 c3abc2bdc2a7c3a7c2b9c2b9557dc3abc2bdc2a7c3a7c2b9c2b9557b5e
UTF-8 谷遜則巽孫孫U}谷遜則巽孫孫U{^ 1110100010110000101101111110100110000001100111001110010110001001100001111110010110110111101111011110010110101101101010111110010110101101101010110101010101111101111010001011000010110111111010011000000110011100111001011000100110000111111001011011011110111101111001011010110110101011111001011010110110101011010101010111101101011110 e8b0b7e9819ce58987e5b7bde5adabe5adab557de8b0b7e9819ce58987e5b7bde5adabe5adab557b5e
UHC 谷遜則巽孫孫U}谷遜則巽孫孫U{^ 1100110111011011111000011110000111110110110011101110000111011110111000011101110111100001110111010101010101111101110011011101101111100001111000011111011011001110111000011101111011100001110111011110000111011101010101010111101101011110 cddbe1e1f6cee1dee1dde1dd557dcddbe1e1f6cee1dee1dde1dd557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)