To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 諺?????嚥??}v諺?????嚥??}vB 100011001011111100111111001111110011111100111111001111111001101010001011001111110011111101111101011101101000110010111111001111110011111100111111001111110011111110011010100010110011111100111111011111010111011001000010 8cbf3f3f3f3f3f9a8b3f3f7d768cbf3f3f3f3f3f9a8b3f3f7d7642
EUC-JP 諺?????嚥??}v諺?????嚥??}vB 101110001100000100111111001111110011111100111111001111111101001111101011001111110011111101111101011101101011100011000001001111110011111100111111001111110011111111010011111010110011111100111111011111010111011001000010 b8c13f3f3f3f3fd3eb3f3f7d76b8c13f3f3f3f3fd3eb3f3f7d7642
UTF-8 諺뚰씗溜뀀젎嚥잙젵}v諺뚰씗溜뀀젎嚥잙젵}vB 1110100010101011101110101110101110011010101100001110110010010100100101111110111110100111100010111110101110000000100000001110110010100000100011101110010110011010101001011110110010011110100110011110110010100000101101010111110101110110111010001010101110111010111010111001101010110000111011001001010010010111111011111010011110001011111010111000000010000000111011001010000010001110111001011001101010100101111011001001111010011001111011001010000010110101011111010111011001000010 e8abbaeb9ab0ec9497efa78beb8080eca08ee59aa5ec9e99eca0b57d76e8abbaeb9ab0ec9497efa78beb8080eca08ee59aa5ec9e99eca0b57d7642
UHC 諺뚰씗溜뀀젎嚥잙젵}v諺뚰씗溜뀀젎嚥잙젵}vB 1110010111101100100011001110110110011101101011001110101011111110101100101110101110100000100011111110011010111111100111111110101110100000101010010111110101110110111001011110110010001100111011011001110110101100111010101111111010110010111010111010000010001111111001101011111110011111111010111010000010101001011111010111011001000010 e5ec8ced9daceafeb2eba08fe6bf9feba0a97d76e5ec8ced9daceafeb2eba08fe6bf9feba0a97d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)