To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 厓る????嚥??v厓る????嚥??vB 111110101000110110000010111010010011111100111111001111110011111110011010100010110011111100111111011101101111101010001101100000101110100100111111001111110011111100111111100110101000101100111111001111110111011001000010 fa8d82e93f3f3f3f9a8b3f3f76fa8d82e93f3f3f3f9a8b3f3f7642
EUC-JP 厓る????嚥??v厓る????嚥??vB 1000111110110100110001111010010011101011001111110011111100111111001111111101001111101011001111110011111101110110100011111011010011000111101001001110101100111111001111110011111100111111110100111110101100111111001111110111011001000010 8fb4c7a4eb3f3f3f3fd3eb3f3f768fb4c7a4eb3f3f3f3fd3eb3f3f7642
UTF-8 厓る젗蓮쇰젲嚥잙젺v厓る젗蓮쇰젲嚥잙젺vB 111001011000111010010011111000111000001010001011111011001010000010010111111011111010011010011001111011001000011110110000111011001010000010110010111001011001101010100101111011001001111010011001111011001010000010111010011101101110010110001110100100111110001110000010100010111110110010100000100101111110111110100110100110011110110010000111101100001110110010100000101100101110010110011010101001011110110010011110100110011110110010100000101110100111011001000010 e58e93e3828beca097efa699ec87b0eca0b2e59aa5ec9e99eca0ba76e58e93e3828beca097efa699ec87b0eca0b2e59aa5ec9e99eca0ba7642
UHC 厓る젗蓮쇰젲嚥잙젺v厓る젗蓮쇰젲嚥잙젺vB 111001001110110110101010111010111010000010010011111001101110010110111100111010111010000010100110111001101011111110011111111010111010000010101101011101101110010011101101101010101110101110100000100100111110011011100101101111001110101110100000101001101110011010111111100111111110101110100000101011010111011001000010 e4edaaeba093e6e5bceba0a6e6bf9feba0ad76e4edaaeba093e6e5bceba0a6e6bf9feba0ad7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)