To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 俑ョズ娃??歪??絶??節??暎⑨?歪?? 10011000110110101000001110000111100000110101100110001000101000010011111100111111100110000110001100111111001111111001000011100010001111110011111110010000110111110011111100111111100111011111001110000111010010000011111110011000011000110011111100111111 98da8387835988a13f3f98633f3f90e23f3f90df3f3f9df387483f98633f3f
EUC-JP 俑ョズ娃??歪??絶??節??暎??歪?? 110100001101110010100101111001111010010110111010101100001010001100111111001111111100111111000100001111110011111111000000111001000011111100111111110000001110000100111111001111111101101011110101001111110011111111001111110001000011111100111111 d0dca5e7a5bab0a33f3fcfc43f3fc0e43f3fc0e13f3fdaf53f3fcfc43f3f
UTF-8 俑ョズ娃숋슨歪뉛쉰絶뽭퐡節긺댚暎⑨풗歪뉛숱 111001001011111110010001111000111000001110100111111000111000001010111010111001011010100010000011111011001000100010001011111011001000101010101000111001101010110110101010111010111000100110011011111011001000100110110000111001111011010110110110111010111011110110101101111011011001000010100001111001111010111110000000111010101011100010111010111010111000110010011010111001101001101010001110111000101001000110101000111011011001001010010111111001101010110110101010111010111000100110011011111011001000100010110001 e4bf91e383a7e382bae5a883ec888bec8aa8e6adaaeb899bec89b0e7b5b6ebbdaded90a1e7af80eab8baeb8c9ae69a8ee291a8ed9297e6adaaeb899bec88b1
UHC 俑ョズ娃숋슨歪뉛쉰絶뽭퐡節긺댚暎⑨풗歪뉛숱 111010011011010110101011111001111010101110111010111010001101111110011001111011111011110110111100111010001110000010000111111011111011110110101110111011111011111010010110111010011011110110001010111011111011110110110001111001111000100010111110111001111011001010101000111011111011111010011010111010001110000010000111111011111011110110100010 e9b5abe7abbae8df99efbdbce8e087efbdaeefbe96e9bd8aefbdb1e788bee7b2a8efbe9ae8e087efbda2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)