To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 怏?????蘂??v怏?????蘂??vB 10011100100010010011111100111111001111110011111100111111111001010100000100111111001111110111011010011100100010010011111100111111001111110011111100111111111001010100000100111111001111110111011001000010 9c893f3f3f3f3fe5413f3f769c893f3f3f3f3fe5413f3f7642
EUC-JP 怏?????蘂??v怏?????蘂??vB 11010111111010010011111100111111001111110011111100111111111010011010001000111111001111110111011011010111111010010011111100111111001111110011111100111111111010011010001000111111001111110111011001000010 d7e93f3f3f3f3fe9a23f3f76d7e93f3f3f3f3fe9a23f3f7642
UTF-8 怏묆끃溜욌젎蘂뚮젩v怏묆끃溜욌젎蘂뚮젩vB 111001101000000010001111111010111010110010000110111010111000000110000011111011111010011110001011111011001001101010001100111011001010000010001110111010001001100010000010111010111001101010101110111011001010000010101001011101101110011010000000100011111110101110101100100001101110101110000001100000111110111110100111100010111110110010011010100011001110110010100000100011101110100010011000100000101110101110011010101011101110110010100000101010010111011001000010 e6808febac86eb8183efa78bec9a8ceca08ee89882eb9aaeeca0a976e6808febac86eb8183efa78bec9a8ceca08ee89882eb9aaeeca0a97642
UHC 怏묆끃溜욌젎蘂뚮젩v怏묆끃溜욌젎蘂뚮젩vB 111001001110100010010001111000111000010110111001111010101111111010011110111010111010000010001111111001111101111010001100111010111010000010100001011101101110010011101000100100011110001110000101101110011110101011111110100111101110101110100000100011111110011111011110100011001110101110100000101000010111011001000010 e4e891e385b9eafe9eeba08fe7de8ceba0a176e4e891e385b9eafe9eeba08fe7de8ceba0a17642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)