To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 汝??秧??耶??z汝??秧??耶??zB 100100111111000000111111001111111110001001011110001111110011111110010110111010110011111100111111011110101001001111110000001111110011111111100010010111100011111100111111100101101110101100111111001111110111101001000010 93f03f3fe25e3f3f96eb3f3f7a93f03f3fe25e3f3f96eb3f3f7a42
EUC-JP 汝??秧??耶??z汝??秧??耶??zB 110001101111001000111111001111111110001110111111001111110011111111001100111011010011111100111111011110101100011011110010001111110011111111100011101111110011111100111111110011001110110100111111001111110111101001000010 c6f23f3fe3bf3f3fcced3f3f7ac6f23f3fe3bf3f3fcced3f3f7a42
UTF-8 汝싧춼秧녔짎耶섆퓘z汝싧춼秧녔짎耶섆퓘zB 111001101011000110011101111011001000101110100111111011001011011010111100111001111010011110100111111010111000010110010100111011001010011110001110111010001000000010110110111011001000010010000110111011011001001110011000011110101110011010110001100111011110110010001011101001111110110010110110101111001110011110100111101001111110101110000101100101001110110010100111100011101110100010000000101101101110110010000100100001101110110110010011100110000111101001000010 e6b19dec8ba7ecb6bce7a7a7eb8594eca78ee880b6ec8486ed93987ae6b19dec8ba7ecb6bce7a7a7eb8594eca78ee880b6ec8486ed93987a42
UHC 汝싧춼秧녔짎耶섆퓘z汝싧춼秧녔짎耶섆퓘zB 111001101010001110011010111001011010110110011000111001001110101110110011111001101010001110011010111001011010110110011000111001001011111110000011011110101110011010100011100110101110010110101101100110001110010011101011101100111110011010100011100110101110010110101101100110001110010010111111100000110111101001000010 e6a39ae5ad98e4ebb3e6a39ae5ad98e4bf837ae6a39ae5ad98e4ebb3e6a39ae5ad98e4bf837a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)